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Abstract 



We present a unified large system analysis of linear receivers for a class of random matrix channels. 

O | ■ The technique unifies the analysis of both the minimum-mean-squared-error (MMSE) receiver and the 

adaptive least-squares (ALS) receiver, and also uses a common approach for both random i.i.d. and 

random orthogonal precoding. We derive expressions for the asymptotic signal-to-interference-plus- 

■^j" . noise (SINR) of the MMSE receiver, and both the transient and steady-state SINR of the ALS receiver, 

O ' 

■ trained using either i.i.d. data sequences or orthogonal training sequences. The results are in terms of 



key system parameters, and allow for arbitrary distributions of the power of each of the data streams and 
the eigenvalues of the channel correlation matrix. In the case of the ALS receiver, we allow a diagonal 



q , loading constant and an arbitrary data windowing function. For i.i.d. training sequences and no diagonal 

loading, we give a fundamental relationship between the transient/steady-state SINR of the ALS and 
' the MMSE receivers. We demonstrate that for a particular ratio of receive to transmit dimensions and 

b ! 

window shape, all channels which have the same MMSE SINR have an identical transient ALS SINR 
response. We demonstrate several applications of the results, including an optimization of information 
throughput with respect to training sequence length in coded block transmission. 

Index Terms 

Large System, MMSE, Recursive Least Squares, MIMO, CDMA. 

* Matthew Peacock is supported in part by the Australian CSIRO 

* Supported by the U.S. Army Research Office under DAAD19-99-1-0288 and the National Science Foundation under grant 
CCR-03 10809 



Submitted to IEEE Trans. Info. Theory - Feb. 05, Revised Aug. 05 



2 



I. Introduction 

Large-system analysis of linear receivers for random matrix channels has attracted significant 
attention in recent years, and has proven to be a powerful tool in their understanding and design 
(e.g., see [1] and references therein). In particular, large-system analysis of the matched filter, 
decorrelator, and minimum-mean- squared-error (MMSE) receivers, which have knowledge of 
the channel state information, has been exhaustively studied for the downlink, using results 
such as the Silverstein-Bai theorem [2], Girko's law [3], and free probability [4]. In this paper 
we take a different approach to the problem, which allows us to consider random i.i.d. and 
orthogonal channels (or matrix of signatures) in the same treatment, in contrast to the existing 
separate analyses of i.i.d. [5] and orthogonal [6, 7] channels. The new approach also allows us to 
consider a more general class of signal models, and a receiver which does not have the benefit 
of channel state information, namely the adaptive least-squares (ALS) receiver. 

In this paper, we consider both the MMSE and ALS receivers. The ALS receiver approximates 
the MMSE receiver and requires training symbols [8]. In particular, the autocorrelation matrix of 
the received vector (an ensemble average), which is used in the MMSE receiver, is replaced in 
the ALS receiver by a sample autocorrelation matrix (a time average). Given sufficient training 
symbols, the performance of the ALS receiver approaches that of the MMSE receiver. In this 
paper, two types of adaptive training modes are considered, based on those presented in [9], where 
either the training sequence is known at the receiver, or a semi-blind method is employed. In a 
time- varying environment, weighting can be applied to the errors to create data windowing, which 
allows for tracking. When implemented online as a series of rank-1 updates with exponential 
windowing, this receiver is often referred to as the recursive least-squares (RLS) receiver [8]. 
To prevent ill-conditioning of the sample autocorrelation matrix with RLS filtering, diagonal 
loading can be employed, which refers to initializing this matrix with a small positive constant 
times the identity matrix. 

Prior relevant work on ALS techniques (in particular, as applied to channel equalization and 
estimation) includes [10], where an approximate expression is derived for the transient excess 
mean- squared-error (MSE) of the ALS receiver with respect to the MMSE receiver, often referred 
to as self-noise, for a general channel model and windowing function. Also, an approximate 
expression is given for the convergence time constant of the receiver when tracking non-stationary 
signals. Steady-state and transient analysis of the RLS receiver was also considered in [11]. A 
comprehensive treatment of ALS techniques and variations is contained in [8]. 
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The application of ALS to CDMA was considered in [12], where the convergence of a blind 
multiuser detector based on a stochastic gradient-descent adaptation rule is also analyzed. In [9], 
approximate expressions are given for the relationship between the MMSE signal-to-interference- 
plus-noise-ratio (SINR) and the steady-state ALS SINR with exponential windowing for DS- 
CDMA in flat-fading. This was extended in [13] to a steady-state analysis of two-stage algorithms 
based on RLS using decision-directed adaptation. A review of adaptive interference mitigation 
techniques is given in [14]. 

Large-system analysis for ALS receivers was first considered in [15, 16], which considered 
DS-CDMA in flat fading with i.i.d. training sequences, and an ALS receiver with diagonal 
loading and rectangular or exponential weighting. The transient SINR (i.e., after a given number 
of training symbols) and steady-state SINR (i.e., after an unlimited number of training symbols) 
of the ALS receiver was derived in the limit where the number of transmit dimensions, the 
number of multiplexed data streams, and the number of training symbols all tend to infinity with 
fixed ratios. With rectangular windowing and no diagonal loading, a relationship between the 
MMSE SINR and the transient ALS SINR was given. In other work, large-system analysis has 
been applied to so-called subspace-based blind ALS receivers in [17-19]. 

In this paper, we consider a more general complex- valued AWGN matrix-vector channel 
model of the form r = HSAb + n, where H is an arbitrary matrix, A is an arbitrary diagonal 
matrix, and S is either an i.i.d. or orthogonal matrix. For example, the model applies explicitly to 
downlink synchronous direct sequence (DS) or multi-carrier (MC) CDMA in frequency- selective 
fading, as well as multi-input multi-output (MIMO) channels. In fact, if we additionally require 
that the data vector b is unitarily invariant, then the MMSE and ALS results we obtain apply 
to all AWGN matrix-vector systems of the form r = Cb + n, provided that the eigenvalues of 
CC^ converge to a well-behaved deterministic distribution. As such, the results can be applied to 
systems not previously considered in large-system analysis, such as equalization of single-user 
finite-impulse-response (FIR) channels. 

For this channel model, we derive the SINR of the MMSE receiver, with either i.i.d. or 
orthogonal S, in the large-system limit where the number of transmit and receive dimensions, and 
the number of multiplexed data streams all tend to infinity with fixed ratios. The expression for 
the SINR is a function of these ratios and the received SNR, and allows for arbitrary asymptotic 
eigenvalue distributions (a.e.d.'s) of AA* and HHt This result can also be derived under the 
same set of assumptions using the S-transform from free probability. However, unlike the free 
probability technique, we show that the technique used to derive this result also applies to the 
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ALS receiver. 

For the ALS receiver, we extend the work of [16] to the general channel model described. We 
consider an arbitrary data windowing function and both i.i.d. and orthogonal training sequences. 
That is, we determine both the transient and steady-state ALS SINR in the limit described for 
the MMSE receiver, and also as the number of training symbols tend to infinity. 

Also, we present an expression which relates the SINR of the MMSE receiver to the transient 
and steady-state SINRs of the ALS receiver for the case of i.i.d. training sequences and no 
diagonal loading. We demonstrate that in this situation, for a particular ratio of receive to 
transmit dimensions and window shape, all channels, which have the same MMSE SINR, have 
an identical transient ALS SINR response. Since our results hold for all well-behaved matrix- 
vector systems for which the data vector is unitarily invariant (as previously discussed), the 
MMSE-ALS relationship is in fact seen to be & fundamental property of adaptive least-squares 
estimation. 

It is interesting to compare our results to an approximate expression for the steady-state ALS 
SINR given in [9] for the special case of DS-CDMA in a flat-fading channel, with i.i.d. spreading 
and exponential weighting. A comparison of the expressions reveals the approximation in [9] to 
be excellent, particularly for large window sizes. Also, we note that for a general channel model, 
the study in [11] previously came to the conclusion that the ALS convergence rate is independent 
of the channel; however, their conclusion was based on making several approximations, and only 
exponential windowing (RLS) was considered. This conclusion was stated for DS-CDMA in [9]; 
however, an explicit relationship between the MMSE SINR and the transient ALS SINR, such 
as derived in this paper, is not given in either of those papers. 

Unfortunately, we have not determined a simple relationship between the MMSE SINR and 
ALS SINR with orthogonal training sequences and/or diagonal loading. This remains an open 
problem. Inspection reveals that with orthogonal training sequences and no diagonal loading, the 
transient relationship will depend on the channel. 

During the course of the analysis we solve for the Stieltjes (or Cauchy) transforms of the 
a.e.d.'s of both the autocorrelation matrix and the weighted sample autocorrelation matrix of the 
received signal. These expressions are found using matrix manipulations, which do not require 
the use of free probability. The first transform can also be derived using free probability; however, 
such techniques cannot be applied to derive the second transform, since the associated constituent 
matrices are not free. As such, the results are of independent mathematical interest. 

Through numerical studies, we demonstrate the applicability of the large-system results to finite 
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systems, and the benefits of orthogonal precoding and training is examined. We demonstrate an 
application of the results in the optimization of information throughput with respect to training 
sequence length in coded block transmission. 

The paper is arranged as follows. Section II outlines the general transmission model, the 
receivers considered, and defines the large-system asymptotic limit. Section III discusses the 
general approach we take for the analysis. Section IV reviews analytical approaches to the MMSE 
SINR problem, and presents the general solution based on the unified analytical approach. An 
alternate expression for the MMSE SINR, different from those presented in [1], is also presented 
which allows the relationship between the MMSE SINR and ALS SINR to be derived in certain 
cases in later sections. Section V presents the general result for the ALS SINR. A simple 
relationship between the ALS SINR and MMSE SINR with i.i.d. training and without diagonal 
loading is then presented in Section VI. Finally, numerical studies are presented in Section VII. 

II. System Model 

This paper considers a general matrix-vector transmission model. It applies to a wide range 
of practical data communication systems, including frequency-flat fading MIMO channels, and 
both frequency-flat and frequency-selective fading downlink DS- or MC-CDMA channels. 

A. General Transmission Model 

In matrix notation 1 , the received signal in the m th symbol period is 

r m = HSAb m + n m (1) 

where 

• H is an M x N complex- valued channel matrix. 

• S = [si ■ ■ • s K ] is an N x K complex-valued matrix which contains either 

- random orthonormal columns, i.e., we assume that S is obtained by extracting K < N 
columns from an N x N Haar-distributed 2 unitary random matrix, or, 

'Notation: All vectors are defined as column vectors and designated with bold lower case; all matrices are given in bold upper 

case; (-) T denotes transpose; denotes Hermitian (i.e. complex conjugate) transpose; (•)* denotes the operation X* = XX*; 

tr[-] denotes the matrix trace; |-| and ||-|| denote the Euclidian and induced spectral norms, respectively; Ijv denotes the N x N 

identity matrix; I' = {i £ 1 : x > 0}; and, expectation is denoted E[-]. 

2 A unitary random matrix fl is Haar distributed if its probability distribution is invariant to left or right multiplication by any 

constant unitary matrix. If X is a square random matrix with i.i.d. complex Gaussian centered unit variance entries, then the 

unitary matrix X (X^X) 12 is Haar distributed. 
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- i.i.d. complex elements 3 with mean zero and variance 4. For example, i.i.d. real & 
imaginary parts which are either ±l/V2iV with equal probability, or i.i.d. Gaussian 
with zero mean and variance \/{2N). 

We shall call the first case 'isometric S', and the second case 'i.i.d. S', as is done in [6]. 

• A is a KxK, diagonal, complex- valued matrix of transmit coefficients, i.e. A = diag(Ai, . . . , Ak). 
In fact, the results which follow depend only the values of P k = \A k \ 2 , and so to simplify 
notation, without lack of generality, we may assume A^, k = 1, . . . , K, is non-negative and 

real valued. 

• The complex vector b m is K x 1 and either contains transmit data, or training symbols (for 
an ALS receiver). Elements of b m can be either i.i.d. with zero mean and unit variance 3 
(for data and i.i.d. training cases), or they can be drawn from a set of orthogonal sequences 
(for orthogonal training, as explained further in Section II-D). 

• n. m contains i.i.d., zero mean, circularly symmetric, complex Gaussian entries with variance 
per dimension a 2 /2. 

• H, S, A, b m , and n m are mutually independent. 

B. Discussion 

The general transmission model in (1) is widely applicable, and in particular includes the 
following systems. 

• Downlink MC- or DS-CDMA: In this case, S represents the matrix of K signatures with 
spreading gain N. Typically, the number of output dimensions equals the number of input 
dimensions and hence H is square. 

- For flat-fading DS-CDMA, H = Ijv and A represents the combined effect of each 
users' transmit power and channel coefficient. 

- For MC-CDMA in frequency- selective fading, H is the diagonal matrix of the channel 
frequency response in each subcarrier, and A represents the transmit amplitude of each 
signature. 

- For DS-CDMA in frequency-selective fading, H is a circulant or Toeplitz matrix 
constructed from the channel impulse response, and A represents the transmit amplitude 
of each signature. 

3 For technical reasons, we also require the elements have finite positive moments. Also, if H ^ Ijv, we additionally require 
that S is unitarily invariant, although we believe the results apply more generally. 
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• 'Rich' MIMO: The standard point-to-point flat fading MIMO channel model is given by (1), 
where K and N correspond to the number of transmit and receive antennas, respectively. 
The standard MIMO channel matrix with i.i.d. circularly symmetric complex Gaussian 
coefficients between each pair of transmit and receive antennas corresponds here to setting 
H = I^r and S i.i.d. Gaussian. The matrix A defines the transmit amplitudes on each 
antenna. 

• MIMO with richness parameter: The MIMO channel model introduced in [20] can also be 
described by (1), with S i.i.d. and H = <E»C, where «J> is i.i.d., and C is diagonal. In this 
case, S models the propagation from the transmitter to a 'scattering array', modeled by C, 
and <J> models the propagation from the scattering array to the receiver. The rank of the 
scattering array matrix determines the richness of the MIMO channel. 

• If we additionally require that the data vector b is unitarily invariant, then the MMSE and 
ALS asymptotic SINR results we obtain apply to all AWGN matrix-vector systems of the 
form r = Cb + n, under certain conditions on the channel matrix C, and the data and noise 
vectors, b and n. A full explanation is given in Section II-E. 

C. MMSE Receiver 

The output of the MMSE receiver with full channel state information (CSI) and knowledge 
of S for stream k at symbol interval m is given by 

b m (k) = cjx m (2) 

where 

c k = R^Hsfc (3) 

R = (HSA)* + a 2 I M (4) 

Now, identifying the signal and interference components of the received signal in (1), i.e., r m = 
A k Hs k h m (k) + r^, the corresponding output SINR is defined as 

CTNTdMMSE E[l C fe( r m — r m)| 2 ] r c\ 

where the expectation in (5) is with respect to b m and n m . The subscript N indicates that this 
is a non-asymptotic quantity. 
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D. ALS Receiver 

The output of the adaptive least-squares (ALS) receiver with i training symbols for stream k 
at symbol interval m > i is given by 

b m (k) = c\v m (6) 

where 

c fc = R _1 s fc (7) 
R = -KWR) + -1 M (8) 

l 7] 

TZ = HSAB f + N (9) 

{■jlZWh k , with training 
(10) 
Hsfc , semi-blind 

and where 

• B is an i x K matrix of training data, where the m th row of B is b^. The k th column of 
B will be denoted as b k . We consider both i.i.d. and orthogonal training sequences, i.e., 

- B contains i.i.d. elements with zero mean, unit variance, and finite positive moments, 
or, 

- B contains either random orthogonal rows or columns. If K < i then B^B = Hk, 
and we assume that ^B is obtained by extracting K columns from an i x i Haar- 
distributed unitary random matrix. If K > i, then BB^ = A'lj, and we assume that 
^7=Bt is obtained by extracting i columns from a K x K Haar-distributed unitary 
random matrix. 

• N is an M x i matrix of noise, where the m th column is n. m . 

• /i is a real- valued non-negative diagonal loading constant, i.e., [i 6 R*, and r] = i/N. 

• W is an i x i diagonal real-valued data windowing matrix, i.e. W = diag(twi, . . . , Wi) 
where w m G R* for m = 1, . . . , %. For example, with exponential weighting w m = e l ~ m , 
where e G (0, 1], or, without data windowing W = Ij. 

Note that, although strictly speaking the model in (1) applies only to time-invariant systems, 
we include windowing to allow for practical situations such as slowly time varying channels, 
or users entering/leaving the system. The term 'semi-blind' in (10) refers to the case when H 
and s fc are known, and there is no training data, whereas 'with training' refers to when just 
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B is known. For more details on the practical issues, see [9], where this ALS formulation is 
considered for DS-CDMA in flat-fading. 

The SINR for the k th stream at the output of the ALS receiver, SINR^, is defined by the 
right-hand side of (5), however, with c k replaced by c k . 

E. Large System Limit 

We define a = K/N and (3 = M/N, and for the ALS receiver, r] = i/N. 

Throughout this paper we consider the asymptotic limit (M, N, K, i) — > oo with K/N — > a > 
0, i/N -> i] > 0, and M/N -> f3 > constant. 

With data windowing it is necessary to consider how W is defined for each i so that the 
empirical distribution function (e.d.f.) of its diagonal values converges to something appropriate. 
Any finite window length becomes negligible in the large system limit as i — > oo, therefore 
it is necessary to scale the window shape with the system size. For example, as in [16], with 
exponential windowing we define L = as the 'average' window length, and take L — > oo 
with L/N — > L > constant. 

To facilitate the large system analysis, we also require that H*, A 2 , and W each have a 
uniformly bounded spectral norm, 4 that is, a bound which is independent of the system dimension 
N. Also, we require the empirical distribution functions of the eigenvalues of A 2 , H*, and W 
to converge in distribution almost surely to non-random distributions on the non-negative real 
axis, which will have compact support due to the previous assumption. We also assume that the 
limiting distributions of A 2 , H*, and W are non-trivial, i.e., do not have all mass at zero. 

Proposition 1: The large-system MMSE and ALS SINRs corresponding to the transmission 
model in (1) are the same as the large-system MMSE and ALS SINRs, respectively, computed 
for any matrix-vector system of the form r = Cb + n in which 

• C, b, and n are mutually independent, 

• the M x 1 noise vector n satisfies the same conditions as n m above, 

• the K x 1 data/training vector b satisfies the same conditions as b m above, and additionally 

4 In particular, this condition is required for the derivations in the appendices, which frequently rely on Lemma 9 in Appendix 
I along with other key lemmas as a precursor to the asymptotic analysis contained in the remaining appendices. 
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is unitarily invariant, 5 

• the M x K channel matrix C is such that the e.d.f. of C* satisfies the conditions on H* 
mentioned in the previous paragraph, 
and corresponds to taking H = C, S to be K x K isometric, and A = Ik, respectively. 

Proof: Since b is unitarily invariant, all data streams have identically distributed SINRs. 
Since b has the same distribution as Ub, where U is a K x K Haar-distributed random unitary 
matrix, we see that the distribution of the MMSE and ALS SINR associated with r' = CUb + n 
are, respectively, the same as that for r, and will share a common large-system limit, if it exists. 

■ 

Proposition 1 implies that our model encompasses the classic equalization model. Namely, r 
represents N = M samples at the output of a single-input/single-output (SISO) FIR channel h 
of length L c , i.e., r(n) = Yl^^o 1 h(£)b(n — t) + n(n). If a cyclic prefix of appropriate length is 
used, we set C defined in Proposition 1 equal to the circulant channel matrix. Therefore, from 
Proposition 1, the corresponding model (1) takes A = Ijy, SasiVxiV isometric, and H as C 
or equivalently as an iV x N diagonal matrix with the iV-point DFT of h on the diagonal. 

III. Unified Large System Analysis 

In Sections IV and V, we derive the asymptotic SINR for the model (1) with both MMSE and 
ALS receivers. The SINR in both cases is directly related to the Stieltjes transform 6 of the a.e.d. 
of the received signal correlation matrix R for the MMSE receiver, and R for the ALS receiver. 
For the MMSE case, there are a number of existing methods for finding such transforms directly 
(see e.g., [1]). However, those methods do not extend to the ALS problem. We now discuss a 
general approach, which applies to both i.i.d. and isometric S for both the MMSE and the ALS 
receiver. 

The aim is to derive a set of equations for each "constituent" dimension in R (or R), which 
can be solved for the Stieltjes transform. For example, R has three constituent dimensions (K, 
N, and M), while R has four, since it also includes i. 

5 That is, the elements of Ub have the same joint distribution as b for any K xK unitary matrix U. This is an extra restriction 
on i.i.d. data vectors. For training vectors from Haar-distributed matrices, this condition is automatically satisfied, and is easily 
verified for a > r\ (that is, when b is a column from a Haar-distributed matrix). For a < 77, note that Ub corresponds to the 
Hermitian transpose of a row of BU^, and BU* can be written as ©EjfU', where is i x i Haar, and Ek = [Ik, Ok,i-k]^- 
Alternately, we have BU* = ©UEk, where U is the unitary matrix created by replacing the upper left K x K sub-block of 
I, with U*. Since © is Haar, ©U is also Haar, and hence BU* has the same distribution as B. 

6 The Stieltjes (or Cauchy) transform of the distribution of a real-valued random variable X is the expected value of 1/ (X — 2), 
where z 6 C + is the transform variable, and C + = {x \ x £ C, lrn(x) > 0} (e.g., see [2]). 
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Each equation is based on expanding the simple identity R _1 R = 1m (or R _1 R = 1m) 
in each constituent dimension. That is, since R (or R) is Hermitian, this term in the identity 
can be written as a sum of vector outer products, where the sum index runs up to the value 
of the dimension. Taking the normalized trace of both sides of the resulting equation can be 
simplified, and involves terms which have equivalent asymptotic forms, which can be evaluated 
using an asymptotic extension to the matrix inversion lemma. These equivalent forms are in 
terms of scalar variables, some of which are mixed matrix moments. Each of these moments 
can be expressed in terms of the other variables. 

The result is a set of equations, which can be solved for the Stieltjes transform, and other 
unknowns (e.g., various matrix moments). Interestingly, in all cases we consider, the equations 
can be written in a form such that solving for the Stieltjes transform numerically amounts to 
zero-finding in at most two dimensions. 

IV. Analysis of MMSE Receiver 

It has been shown that, for both i.i.d. and isometric S, the asymptotic SINR for the k th stream 
at the output of the full-CSI MMSE receiver in (3) satisfies [7,21] 

max |SINR*™ SE - P k p»\ ^0 (11) 

k<K 1 ' 1 

under the limit considered, where ■—> denotes almost-sure convergence, 

N UtrpHtR-iH] , i-i.d. S, 

Pi — \ UA) 
I ^trpHtR^H] , iso. S, 

and II = Ijv - S*. 

We now discuss some existing methods for computing the limit of the moment p^ in (12) with 
the MMSE full-CSI receiver, and note that the methods do not extend to the ALS receiver. We 
then present the main result of this section, namely, a general SINR expression, which applies 
to both i.i.d. and isometric S, derived using the approach discussed in Section III. 

For i.i.d. S, and square invertible H, the SINR can be obtained in terms of the limiting 
distribution of and A 2 , using the result of Silverstein and Bai [2] after writing 

pf = ^tr[(SA 2 St + cr 2 (H t H)~ 1 ) -1 ] (13) 

as was done in [22], and for more general channel distributions in [6, Theorem 2]. More generally, 
a solution for arbitrary (non-square) channel models can be obtained for i.i.d. S via Girko's law 
(see e.g., [5, Lemma 1], or [23, Theorem IV.2] for just the Stieltjes transform of R), again in 
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terms of the limiting distributions of H* and A 2 . We note that neither of these techniques can be 
used to compute the output SINR for the MMSE receiver with isometric S, or the ALS receiver. 

For isometric S and square H, the asymptotic SINR was first presented in [6] with A = Ik, 
and was extended in [7] to include general A. Although this approach could also be used to 
consider non-square H, it does not extend to the ALS receiver since it relies on the particular 
structure of R, which is not shared by R. 

A. Asymptotic SINR for MMSE Receiver 

The following theorem allows us to evaluate the limit of p± (and hence the asymptotic MMSE 
SINR) for general channels and for either i.i.d. or isometric S. The theorem is in terms of the 
Stieltjes transform of the e.d.f. of the eigenvalues of (HSA)*. That is, we generalize the definition 
of R from (4) by replacing a 2 by a complex variable z E C + (i.e., R = (HSA)*— zIm), such that 
the Stieltjes transform of the e.d.f. of the eigenvalues of (HSA)* is given by (z) = ^jtr[R -1 ]. 
The theorem is given in terms of the two additional random variables p^ E C + , as defined in 
(12) using the redefinition of R, and E C + . The variable rf is defined in terms of matrix 
equations, and is given in Appendix II- A since the definition is lengthy and is not needed to 
state the following result. 

Theorem 1: Under the assumptions in Section II-E, as (M, N, K) — > oo with M/N — > j3 > 
and K/N — > a > fixed, the Stieltjes transform of the e.d.f. of the eigenvalues of (HSA)*, 
(z), z E C + , along with pf and rf satisfy 




7i 



a.s. 



0. 



(14) 



Pi 



N 



Pi 



a.s. 







(15) 



7"i 



A 1 



a.s. 



0. 



(16) 



where 7i,pi,Ti E C + are solutions to 




(18) 



(17) 



(19) 
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where (3* = mm((3, 1), and 



E 



pm 



H 



(20) 



E 



1 + Hz~ 1 {tx - ap) 



(21) 



for m G Z*. The expectations in (20) and (21) are with respect to the scalar random variables P 
and H, respectively, and the distributions of P and H are the asymptotic eigenvalue distributions 
of A 2 and the first (3*N = min(M, N) non-zero eigenvalues of H*, respectively, and p = E[P]. 



• If (17)— (19) has a unique solution 7 7i,pi,Ti G C + for any given z G C + , then Theorem 1 
additionally implies that the e.d.f. of the eigenvalues of (HSA)* almost surely converges 
in distribution to a deterministic distribution, whose Stieltjes transform is G R (z). 
Moreover, we have that p^ converges almost surely to the deterministic value p\ in the 
limit considered, and so, letting z = —a 2 + ej and taking e — > 0, as suggested by (11), the 
asymptotic SINR of the k th data stream almost surely converges to PkPi- 

• For i.i.d. S, Theorem 1 can be obtained via Girko's law (see e.g., [23, Theorem IV.2]). For 
isometric S, this result appears to be new. However, in both cases Theorem 1 can be derived 
(under the same set of assumptions) using the S-transform from free probability, or the 
method of [24]. We give a different proof, relying only on elementary matrix manipulations. 
The primary reason for presenting this result is to lead into the ALS analysis, which will 
follow the same general procedure outlined in the proof of Theorem 1 in Appendix II. 

• The following steps describe how to find Gr(z) numerically via Theorem 1 for specific 
distributions of P and H, and given values of z, a, and (5. 

- Consider (18) as a scalar function of p\, i.e., X{p\) = 0. 

- Numerically find the unique positive root of X(p t ) using standard techniques (e.g., us- 
ing a routine such as fzero in Matlab), where, for a given value of p\, the corresponding 
values of T\ and 71 are directly evaluated using (19) for n, and (17) for 71. 

• In fact, (17) is just one of many possible expressions which can be derived from the identity 

'Certainly, for i.i.d. S, the equations of Theorem 1 have a unique solution, since the same result can be obtained via Girko's 
law, for which the solution is known to be unique. Although it is not proved, we believe that this is also true for isometric S. 
Numerical studies support this. 



Proof: See Appendix II. 



Remarks: 
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-jgtrfRR x ] = 1. Other expressions involving 71 derived in this manner include 

0(1 + 271) = a(l - £0,1) = a Pl S 1A (22) 
= (3*(1- W ,i) = Pz-^n - ap)Hi A (23) 

These expressions, derived in Appendix II, are used in the proof of Theorem 1. 
• If H is exponentially distributed with mean one (i.e., MC- or DS-CDMA in frequency- 
selective Rayleigh fading), H\ : i = (1 — l(x))/x, where x = z~ x (t\ — ap), and i(x) = 
x~ l exp (x -1 ) Ei (x~ r ) where Ei(x) = e~ xt t~ l &t is the first-order Exponential Integral. 



B. Alternate Representation of MMSE SINR 

We now present an alternate expression for the asymptotic value of SINR^ SE , which will 
allow us to determine the relationship between the asymptotic MMSE SINR and the asymptotic 
ALS SINR considered later in Sections V-VI. This expression depends on the additional random 
variables p^ E C + , j = 2,3,4, which, along with the auxiliary random variables E C + , 
j = 2,3, are defined in terms of matrix equations in Appendix II- A (The definitions of these 
variables are lengthy, and are not needed to state the following result; so to facilitate the flow 
of results they are not stated here.) 

It is shown in Appendix IV that, under the assumptions in Section II-E, 



max 

k<K 



sinrHT 



N I 







(24) 



as (M, N, K) -> 00 with M/N -> /3 > and K/N -> a > 0. Moreover, \pf - Pj \ ^ 0, 
j = 2, 3, 4, and — tj | 0, j = 2, 3, in the limit considered, where pj E C + , j = 2, 3, 4, 
and Tj E C + , j = 2, 3, are solutions to the following set of equations. For i.i.d. S, 

' (3*\z\~ 2 (n h2 + T 2 n 2 , 2 ) , j = 2, 

<,(3*\zr 2 (l + T 3 )H 2 , 2 , j = 3, (25) 

ap3^i,2 , j = 4. 
Tj = apj£ 2 , 2 , j = 2, 3. (26) 
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and for isometric S, 



P*\Z\~ 2 (Wl,2 + T 2 W 2|2 ) 



< p* \z\~ 2 { 
a(p3^i,2 



r 3 )H 



2.2 



I I 2 c 
Pi f 



1,2J 



, 3 



2. 
3. 
4. 




(27) 



2. 
3. 



Also, 



£ 



P' 



1+1 



m,2 



7i 



m,2 



(28) 

(29) 
(30) 



|1 + Hz' 1 {t 1 -ap)\ 2 
for m e Z, and px,T\,p,(3*,P and if are determined from Theorem 1. 

Again, assuming that (17)— (19) and also (25)-(28) have unique solutions, then we also have 
that the SINR of the k th data stream converges almost surely to the deterministic quantity 

Pklplf /(P4 + (T 2 P2). 

V. Analysis of ALS Receiver 

In this section we derive the asymptotic SINR for the adaptive receiver, using the general 
approach discussed in Section III. 

Firstly, we derive the asymptotic transient ALS SINR after a specified number of training 
intervals (either with training, or semi -blind). Our aim is to characterize the typical transient 
response of the receiver as a function of 77, i.e., as the number of training symbols increases. 
The resulting expression is in terms of several large matrix variables involving the sample 
autocorrelation matrix. We present a theorem which gives the Stieltjes transform of the a.e.d. 
of the sample autocorrelation matrix, and fixed-point expressions for each variable required to 
compute the asymptotic SINR. 

Secondly, from the transient SINR solution we determine the steady-state asymptotic ALS 
SINR, that is, the SINR as the number of training intervals (either with a training sequence, or 
semi-blind) goes to infinity (i.e., 77 — > 00). Without data windowing, we verify that the solution 
for the ALS SINR converges to the MMSE SINR. Then we determine the steady-state SINR 
when an arbitrary windowing function is used. 
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A. Transient ALS SINR 

The following result relates the transient SINR of the ALS receiver to six auxiliary random 
variables, and ^ £ C + , j = 1, 2, 4. The definitions of these variables are in terms of matrix 
traces and quadratic forms, and are quite lengthy. So, for clarity of presentation, and also since 
asymptotically equivalent values of these variables can be calculated from subsequent results, 
the definitions are given in Appendix VTA. 

Theorem 2: In the limit as (M, N, K, i) -> oo with M/N -> f3 > 0, K/N -> a > 0, and 
i/N —> 7] > fixed, 

P \n I 2 \n N \ 2 



max 



SINR^ 



|a M | 2 {Pa + ° 2 $) + M « + ^ 2 V#) 



a.s. 



(31) 



where 



(1, -A k p? ) , semi-blind LS 

(a k ,i, a fc>2 ) = <( a (32) 

(y4 fc (tZ; — //Sf), 1) , LS with training, 

w is the mean of the a.e.d. of W, and the definitions of and ^ , j = 1, 2,4 are given in 
Appendix VTA. 

Proof: See Appendix VII. Note that the definitions and derivations of Appendix VI 
necessarily precede Appendix VII. ■ 
Remarks: 

• Expressions, which can be used to compute asymptotically equivalent values of and ^ , 
j = 1, 2, 4, are presented in Theorem 3 and Lemma 1. 

• The preceding ALS SINR expression resembles the alternate MMSE SINR expression (24) 
derived in Section IV-B. However, a simplified expression for the ALS SINR, such as that 
presented for the MMSE SINR in Theorem 1, is not possible. This is due to the fact that a 
simplification of the interference power, as discussed in the proof of (24) in Appendix IV, 
is not possible for the ALS receiver. 

The following Theorem and Lemma give a sufficient number of relations to calculate the 
asymptotic moments required for the asymptotic SINR in (31) of Theorem 2. 

In a similar manner to Section IV, firstly we determine expressions for the Stieltjes transform 
of the e.d.f. of the eigenvalues of the sample autocorrelation matrix. That is, we generalize 
the definition of R as follows, R = — z! M , where z e C + , such that the Stieltjes 

transform of the e.d.f. of the eigenvalues of hlZWTV is given by G£(z) = -jjtrfRr 1 ]. The 
result is necessarily stated in terms of the additional random variables pf, ff, , u^, z>f , 



Submitted to IEEE Trans. Info. Theory - Feb. 05, Revised Aug. 05 



17 



and ff G C + . As in Theorem 2, the definitions of these variables are in terms of matrix traces 
and quadratic forms, and are lengthy. To facilitate the presentation of results, the definitions of 
these variables are given in Appendix VI-A. 

Theorem 3: Under the assumptions in Section II-E, as (M, N, K, i) — > oo, with M /N — > (3 > 
0, K/N — > a > 0, and i/N — > rj > fixed, the Stieltjes transform of the e.d.f. of the eigenvalues 

, along with p±, ff , $f , u)f , z>f , and ff G C + satisfy 



of \nWTZ\ G?{z), z € 



Ti 



0. 



(33) 





-Pi 


a.s 


►o, 


Iff 


- fl 


^0, 






a.s 


►o, 










- £l 


a.s 




Iff 


- n 


^0. 



(34) 
(35) 
(36) 



where 71, p\, f x , -0i, u>i, z>i, and f x G C + are solutions to 



a 



71 = z>i ( 1 + -7jPi(ipi - w)S hl 



(37) 



Pi 



1 - (3(1 - 7i/z>i) 



, i.i.d. S, 
, iso. S, 



(38) 



(3* = min(/3, 1), and 



a(V>i - u>)£i,i 



1.1 



l-/3(l-7iM) 



i.i.d. S, 
iso. S, 



(39) 



Wi,i 

^1 = w - { Wn 



Ml 



l-^(l-7iM) 



, i.i.d. B, 
, iso. B. 



a 



1.1 



i-#(i-tim; 



a 2 Wi 1 - z 



P 2~ 

r 1 = u)\ H — a 71 
V 



, i.i.d. B, 
, iso. B. 



(40) 



(41) 

(42) 
(43) 
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max(?7, a), and 



£ 



m,l 



n 



E 

E 
E 



i - Ppx{i)\ - w) 

1 — Hi>i(T\ — apw) 
W m 



(44) 

(45) 
(46) 



1 + Wfx. 

for m G Z*. The expectations in (44), (45), and (46) are with respect to the scalar random 
variables P, H, and W, respectively, where the distributions of P, H, and W are the a.e.d.s of 
A 2 , the first (3*N eigenvalues of H*, and W, respectively. Also, p = E[P] and w = E[W]. 

Proof: See Appendix VI. ■ 
Remarks: 

• If (37)-(43) has a unique solution 71, p\, t\, ipi, i>\, f\ G C + for any given z G C + , then 
Theorem 3 additionally gives that the e.d.f. of the eigenvalues of VlV'NV) almost surely 
converges in distribution to a deterministic distribution, whose Stieltjes transform is G^(z). 

• Note that for i.i.d. B and fi — * (i.e., i.i.d. training sequences and no diagonal loading), 
Wi,i7i, VV^ipi, and VVi,i(fi — apw) satisfy the same equations as 7 X , p u and (t\ — ap) 
from Theorem 1. Moreover, due to (50), we have that W^i is a function of only (3, r], and 
the window shape. This observation, along with the alternate MMSE SINR expression of 
Section IV-B, are the key elements in determining the relationship between the ALS and 
MMSE SINRs, outlined later in Section VI. 

. For exponential weighting with L < 00 (where L is the large-system window size defined 
in Section II-E), in Appendix V the e.d.f. of W is shown to converge in distribution to the 
fixed distribution 

L 



e~ v/L <w<l 



n 



which is the relevant distribution of W required in (46). Also, for z — 
W m 1 which are required can be evaluated using (47), and are given by 



L 

V 
1 



log 



1 + fi 



(47) 

Re(^), values of 

(48) 
(49) 



In fact, (37) is one of many possible expressions, which can be derived from the identity 
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^tr[RR x ] = 1. Other expressions involving 71 derived in this manner include 



0(1 + *7i) 



V{1 -W ,i) 



(50) 



P{l-*fi/h) 



(51) 



a(l -£ ,i) : 
/3*(1- ^0,1) 



api(^i -iy)f lf i 
—V\{j\ — aptv)P*H l! i 



(52) 



(53) 



Note the similarity to the expressions (22)-(23), derived in a similar manner for the MMSE 
receiver. These expressions, derived in Appendix VI, are used in the proof of Theorem 3. 
• Note that £ m ,i, 'Hm,i, an d W m ,i are all of the form X m ^ = E [X m /(1 + xX)], for which 
the following simple and useful identity holds. 



for m £ Z*. Observe also that the last equality in each of (22), (23), (52) and (53) follows 
from the identity (54), as does (49). This identity also relates (50) to (51), although in a 
less obvious way. 

• The set of equations in Theorem 3 can be solved numerically in a similar manner to that 
discussed for the MMSE case after Theorem 1 . Here, however, it is advantageous to consider 
(38) and (41) as a two-dimensional equation in the variables 8 p\ and f\. During zero finding, 
given these values, the remaining variables 71, &\, fi, ipi, z> 1? W^i, £1,1, and TC^i can be 
directly calculated. 

• Similar to [2, Section 3], it is possible that through a process of truncation and centralization, 
the condition that the moments of S and B are bounded when either matrix is i.i.d. may 
be removed. 

Using Theorem 3, we may now calculate pi and which are asymptotically equivalent to 
Pi and ipi, two of the quantities required to compute the SINR in Theorem 2. 

The following Lemma gives expressions, which may be solved for quantities asymptotically 
equivalent to pj 1 and ip^ £ C + , j = 2,4, and which occur in (31) of Theorem 2. This Lemma 
introduces more auxiliary random variables in C + , namely, yF, p^, ^ , and Cjf , f^, j = 2, 3, 4, 
plus and , j = 2, 3, which are defined in terms of matrix traces and quadratic forms in 
Appendix VI-A. 

8 That is, unless z —* is being considered (i.e., no diagonal loading), in which case it is necessary to instead consider pi 
and £>i as the search variables, since f\ depends only on j3, ?y, and Wo,i due to (50). 



E[X m ] = E X 



in 



1+xX 
1 + xX 




(54) 
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Lemma 1: In addition to the assumptions and definitions of Theorem 3, under the limit spec- 



0. 



0, and f 



N 



-» 0, \ujj — ujj | 

0, j = 2, 3, where jj, pj, ipj, ujj, and fj, 



ified, \ fy - 7j| — > 0, |pj v - 
0, j = 2,3,4, and |rf -r,-| — 
j = 2, 3, 4, and r, and j = 2, 3, are solutions to the following equations. For i.i.d. S, 

V(Wi,2 + N 2 T 2 7i: 2 , 2 ) , j = 2, 

/5*(Wi,2 + |AL| 2 (l + r3)W 2| 2) , J = 3, 



Pi 



a(P3^i,2 + ^3 |pi| 2 £ 



2,2 J 



fj = a C0j£i j2 + 



w — Ipi 



, J=4, 
J = 2,3, 



and for isometric S, 



Pi 



/3*(i> 2 Wi, 2 + f 2 |i>i| W 2)2 ) - cr0 2 £ 1>2 , j = 2, 

/9*(^i,2 + (l + f 3 )|z>i| 2 7i 2 , 2 )-a4|pi| 2 42 , J = 3, 

«(P3^1,2 + ^3 |pl| 2 4,2 - (1 +^4) |pl| 2 ^l,2) , j=4, 

"(42 - 1) + 1 



where for i.i.d. B, 



(55) 



(56) 



(57) 



1 a(i) 2 £i,2 + 


w — ipi 


2 


- /3*i> 2 apw - fi 2 7Yi, 2 


, J = 2, 1 




w — ^1 


2 

P3^2,2) 


- /3* apw - fi| 2 (z> 3 7^i,2 + |i>i| 2 ^2,2) 


, J = 3,j 


/3*(Wo,2 - 1) + 1 



(58) 



CO i 



f(p 4 42+|pl| 2 (1+^4)4,2) 



while for orthogonal B 



a 



3 = 2,3, 
J = 4, 




f 4 W 2i2 - ^-\W - V>l| 2 (p 4 ^l,2 + |pi| 2 <^2,2) , 
7]* 

^,2-l) + l 

~(Pj£l,2 + |pl| 2 ^'^2,2) - 0" 2 4^' 1^1 1 2 >V 2 ,2 

-(p 4 42 + |pil 2 (i + ^4)4,2) -^ 2 4^l^il 2 >v 2 , 2 , 7 = 4, 

^(|^i| 2 W 2 , 2 -2Re(w 1 >V 1 , 1 )) + l 



J = 2,3, 



(59) 
(60) 



(61) 



(62) 
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Also, 



20 

ujj + a -7j , j = 1,...,4, 
rj 



^-*(|r 2 Wi )2 -7i) 



(z>2^1,2 + \V\ | 2 f 2 H 2 ,2)) 



o(Pa£l,2 + ^2 |pi| £2,2) 

|Pi| 2 (l + a 2 f 2 >V2,2) , 

(T 2 |t>l| 2 f 3 W2,2 , 



2. 
3. 
i. 



(63) 



(64) 



2. 
3. 



(65) 



and 



m,2 



W m , 2 



E 
E 



Ppi(^i - w) 



_|1 — Hviifi — apw)\ 
W m 



(66) 

(67) 
(68) 



where 71, p\, f\, tpi, u\, i>\, f\, £ m , n , 7~L m ,n, and W m , n are determined by Theorem 3, and again 
z -> — Afc/?7. 

Proof: The proof of Lemma 1 follows the same approach as the derivation of Theorem 3. 
That is, expressions for pj, fj, 'i/jj, Uj and Vj for j = 2,3,4 are derived in the same manner as 
the expressions for p x , t x , ipi, £>i and V\ in Theorem 3, respectively. Also, as the expression for 
71 in Theorem 3 is derived from the identity RRr 1 = 1 M , so the expression for 72 in (64) is 
derived from the identity RtRrtR -1 = R -1 . A full derivation can be found in [25]. ■ 
Remarks: 

• For exponential weighting with L < 00, as z — > Re(z), the required expressions for W m> 2 
can be evaluated using (47), and are given by 



m,2 

W 2 ,2 



W 



(1 +fi)(l +e-^l L fi) 

-(Wy - Wi l2 ) 



(69) 
(70) 



where w 



'1 - e""/^ 
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• Note that E m> 2, H m ,2, and W m ,2 are all of the form X m>2 = E [X m / |1 + xX| 2 ], for which 

the following simple and useful identity holds. 

_ \ (l + x*X)X^ 
[ \l + xX\ J 

for m E Z*. This identity can be used to simplify the calculation of certain terms in Lemma 
1, and also gives (70). 

• These equations can be solved numerically using three zero-finding routines, two of which 
are for two variables, while the third is for one variable. Specifically, 

1) First solve the subset of equations given by (63), (64), (65), and depending on the type 
of S, (55) and (56), or (57) and (58), and depending on the type of B, (59) and (60), 
or (61) and (62), to find f 2 , 72, v 2 , p 2 , f 2 , ip 2 , and Cj 2 . This can be done numerically 
using a zero-finding routine for the two variables f 2 and p 2 . 

2) Solve the subset of equations given by (63), (64), (65), and depending on the type of 
S, (55) and (56), or (57) and (58), and depending on the type of B, (59) and (60), 
or (61) and (62), to find f 3 , 73, z> 3 , p 3 , T3, ^3, and c2> 3 . This can be done numerically 
using a zero-finding routine for the two variables cj 3 and p$. 

3) Now solve the subset of equations given by, (63), (64), and depending on the type of 
S, (55) or (57), and depending of the type of B, (59) and (60), or (61) and (62), to 
find f 4 , 74, p 4 , ^4, and 0)4. This can be done numerically with a zero-finding routine 
for one variable, namely ^4. 

• The solution (64) is one of many possible expressions, which can be derived from the 
identity -^trfRRr 2 ] = -gtrfR" 1 ]. All possible expressions involving 72 derived in this 
manner include 

7i + ^*72 = |r 2 Wi, 2 (72) 
- l\v 2 = -piwi - ™)*P2 + Pii>2) \h\ £1,2 (73) 
= ~ oipw)*v 2 + z>if 2 ) |z>i| 2 Wi )2 (74) 

When solving the set of equations in Lemma 1, both of these expressions are more useful 
than (64) when considering z — > (i.e. no diagonal loading). 

B. Steady-State ALS SINR 

We now determine the steady-state ALS SINR, that is, the SINR as the number of training 
intervals r\ — > 00 (either with a training sequence, or semi-blind), from the transient ALS SINR 
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expressions in Section V-A. Of course, if there is no windowing (i.e., W = I;), and diagonal 
loading pi/r) for any fi > 0, then the output SINR converges to that of an MMSE receiver with 
full CSI. We first verify this result, and then turn to the more interesting case of data windowing 
and (optionally) diagonal loading. We will see that the steady-state response is the same for both 
i.i.d. and orthogonal training sequences, which matches intuition, since i.i.d. training sequences 
become orthogonal as 7] — > oo. 

An approximate analysis of the steady-state performance of the ALS receiver with exponential 
windowing was presented in [9] for DS-CDMA with flat fading. The large-system steady-state 
ALS performance is considered in [16, Corollary 2]. In [16], results from asymptotic analysis 
of reduced rank filters are used, which rely on arguments related to non-crossing partitions. 
Here we give a more direct derivation of the large-system steady-state performance of the ALS 
receiver for the general transmission model (1). 

Strictly speaking, Theorems 1 and 3 require z £ C + , however for the following discussion 
we shall implicitly consider z — > —a 2 and z — > —fifr], respectively. 

1) No Windowing: We first consider the limit of the equations in Theorem 3 as 7] — > oo, and 
show that without windowing (i.e., W = L) the ALS SINR converges to the MMSE SINR. 

First note that a)i — > for both i.i.d. and orthogonal B, which means W m>n — > 1, and therefore 
— > and i>i — > 1/cr 2 . Moreover, E m>n — > £ m ri and / H m , n — > 7i m , n from Theorem 1. We see 
that the expressions for the ALS moments 71, pi and t\ from Theorem 3 converge to the MMSE 
moments 71, p 1? and T\, respectively, in Theorem 1 at z — > —a 2 as 77 — ► 00. 

Now consider the limit of the equations in Lemma 1 with no windowing as 77 — > 00. Clearly, 
0J2, W3, and cj 4 all — > 0, and hence also ^2, ^3, and ^4 also — > for either i.i.d. or orthogonal 
B. It follows that z> 2 (V " 2 ) 2 and h ~^ °- 

Substituting the preceding limits into (55)-(58), we see immediately that as rj — > 00, the 
variables p 2 , P3, P4, f 2 , and f 3 satisfy the same set of equations as p 2 , pz, Pa, t 2 , and r 3 for 
the MMSE receiver, which appear in the SINR expression in Section IV-B, given by (25)-(28). 
Therefore, p 4 + a 2 p 2 — >• p* as 77 — > 00, and therefore the ALS SINR converges to the MMSE 
SINR as 77 — >• 00. The diagonal loading constant pi/r) disappears in this limit. 

2) Fixed-Length Data Windowing: The following results apply to fixed-length windowing 
functions. That is, the large-system window size does scale with r]. 9 For example, in the case 

9 Of course, the actual window size should increase with iV in order to define a meaningful large system limit, as explained 
in Section II-E. Here we are referring to the large-system window size after the large-system limit has been determined. With 
exponential windowing this is the difference between L and L. 
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of exponential windowing this corresponds to fixed L. More precisely, we define fixed-length 
windowing as 

lim ??E[WJ > (75) 

71— *00 

, w < 

lim F Wri (w) = { (76) 
, w > 



77— >00 



where, for given 77, W v denotes a scalar r.v. with distribution F Wr] (w), given by the (compactly 
supported) a.e.d. of W. In other words, F w converges in distribution to a delta-distribution at 
zero as 77 — > 00, with the mean of W v of order 77 -1 . For example, for exponential windowing, 
the mean window size is w = ^(1 — e~ v ^ L ). 

We have the following corollary to Theorem 2, Theorem 3, and Lemma 1, which specifies 
the steady-state ALS SINR with fixed-length windowing. 

Corollary 1: Under the limit specified in Theorem 2, and also as 77 — > 00, provided all 
asymptotic moments exist, the asymptotic steady-state SINR for the ALS receiver for stream k 
with fixed-length windowing is given by the asymptotic ALS SINR specified in Theorems 2, 3, 
and Lemma 1, where 

• the i.i.d. B relations are used for both i.i.d. and isometric B, 

• z is replaced by — fi, 

• Tj is replaced by one, and, 

• the variables w and W mi „ are replaced by w and VV m , n , where 

w = lim r]w (77) 

77-^00 

W m . n = lim V W m , n (78) 

77— *oo 

Proof: See Appendix VIII. ■ 
Remark: 

• With exponential windowing, w = L, and from (78) we have 

m^^Z^ (79) 
m,2 = — L — (so) 

1 + T\ 

W 2 , 2 = ^ - Wi, 2 ) (81) 



n 

where f\ = exp(^(l — /X71)) — 1, and we have used (50) to obtain (79). 
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VI. Relationship Between MMSE and ALS Receivers 

In this section we present a simple relationship between the SINRs of the MMSE and ALS 
receivers given in the previous sections. We note that this relationship has recently been studied 
in the special case H = Lv, i.i.d. S, i.i.d. training, and no diagonal loading. An approximate 
relation was given for rectangular windowing in [17], with a corresponding exact large system 
expression given in [16, Corollary 1]. Also, [9] obtained approximate expressions for the steady- 
state SINR relationship (r) — > 00) with exponential windowing. 

A. Transient Response 

The following theorem applies to any H, both i.i.d. and isometric S, and any windowing 
shape. The only restrictions are that there is i.i.d. training and no diagonal loading (i.e., /i = 0). 
The theorem relates the expressions in Theorem 3 and Lemma 1 to the alternate MMSE SINR 
expression of Section IV-B. 

Theorem 4: For the k th data stream, the asymptotic SINR of the full-CSI MMSE receiver 
SINRf MSE is related to the asymptotic SINR of the ALS receiver with i.i.d. training sequences, 
data windowing, and no diagonal loading, SINR£ LS , according to 

(jTvrpMMSE 

SiNRf s = MiNKfc ; - (82) 



with training sequences, and 



SINR MMSE 



ctivjrMMSE 

SINRf s = , , ^ (83) 



C + (C - l)SINRf MSE 
for semi-blind training, with either i.i.d. or isometric S, where 

which depends only on rj, (3, and the window shape. 

Proof: See Appendix IX. ■ 
Remarks: 

• To calculate £, note that from (50) we have Wo,i = 1 — - for any window shape. Since 
Wo,i is a fixed known function of fx, we can invert this equation to find f\. For example, 
with exponential windowing, we obtain from (48) that 
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and with rectangular windowing, fx = (3/(r} — /3). Given fx, we can directly calculate W m , n 
from the definition given in (46), and ( from (84). The point here is that f 1? W m . m and ( 
are essentially constants, depending only on (3, r], and the window shape. 

• Remarkably, Theorem 4 implies that SINR^ LS only depends on (3, 77, the window shape, 
and SINR^ MSE . That is, the convergence rate of the ALS SINR to the steady-state value is 
independent of the channel (of course, the steady-state value itself depends on the channel). 
Stating this another way, for a particular (3 and window shape, all channels, which have 
the same MMSE SINR, have an identical transient ALS SINR response. 'Channel' here 
refers to the product HSA. This has been observed in [9, 11], although a transient SINR 
relationship, such as that given in Theorem 4, has not previously been determined. 

. Recall that due to Proposition 1, Theorem 4 also holds for the general AWGN model 
r = Cb + n for which b is unitarily invariant and the eigenvalues of C* are well behaved. 
In that sense Theorem 4 is a fundamental property of linear estimation. 

. In fact, our assumption that the additive noise n m is i.i.d. complex Gaussian distributed is 
an unnecessary restriction, as all results presented hold for any distribution such that n m is 
unitarily invariant, and the elements of n m are i.i.d. with zero mean and variance a 2 . 

• With exponential windowing we have 

c = ^ - e ~ v/L) 

and for rectangular windowing, we have C, = I + (3 / — (3). With rectangular windowing 
and (3=1 (i.e., square H) this matches the expression derived in [16] for DS-CDMA with 
i.i.d. signatures in flat fading. 
Unfortunately, we do not have a compact expression, analogous to Theorem 4, which relates 
the ALS and MMSE SINRs with orthogonal training sequences and/or diagonal loading, although 
it seems likely that such a relationship exists. What we can say is that with orthogonal training 
sequences and no diagonal loading the moments of Theorem 1 and Theorem 3 are related via 

A = ^P. (87) 

7i = J-7i (88) 
yy 1,1 

Wi i 

fx = (ti — ap) + awp (89) 

where Db = 1 — Interestingly, these relationships depend on the channel through Db 

(which was not the case with i.i.d. training). Finding a corresponding relation for the SINRs 
with orthogonal training remains an open problem. 
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Fig. 1. C, vs. L for r\ = oo and a range of /3, from (84). 



5. Steady-State Response 

For the steady- state response (rj 
with 



oo) with fixed-length data windowing, Theorem 4 holds 



c 



(90) 



where W n 



lim^^oo r\W mn . This result is proved simply by letting 7] — > oo in Theorem 4. 



Of course, this steady-state relationship also holds for orthogonal training sequences. 



With exponential weighting, we have from (86) that ( 
as we increase the window size) £ 
expected. 



/3 



L(l-e-f/£) 

1 (using L'Hopital's rule), and SINR 



. Note that as L — > oo (i.e., 



ALS 

k 



sinr 1 : 



MMSE 



as 



In [9], similar approximate relationships were derived for the steady state performance of the 
ALS receiver with exponential windowing for DS-CDMA in fiat fading. The equivalent value 
of ( there is (' = ^r{N — 1), which converges to (' — > ~ + 1 as N — ► oo after substituting 
e = 1 — j~. Fig. 1 shows a plot of this approximation, which is quite accurate when compared 
to the exact large-system value at {3 — 1, particularly for large L. 



Submitted to IEEE Trans. Info. Theory - Feb. 05, Revised Aug. 05 



28 



C. Capacity Relationship 

Consider the difference in capacity per data stream 10 of the MMSE and the ALS receivers, 




C 

with training sequences. 

As C, — > 1, the capacity difference approaches zero, whereas as £ — > oo, the difference 
approaches In (l + SINRf MSE ). Recall that ( only depends on 7] (the ratio of training symbols 
to transmit dimensions), j3 (the ratio of receive to transmit dimensions), and the window shape, 
and does not depend on the SNR, the (normalized) number of data streams a, or the channel 
distribution. Nor does this value depend on whether S is i.i.d. or isometric. Fig. 1 shows the 
steady-state value of £ vs. L with exponential windowing and a range of (5 values. 

VII. Numerical Studies 

We now present various applications of the results presented in previous sections. We shall 
focus on three example systems: 

• The first is the standard model of a MIMO channel with rich scattering, for which we 
set H = Ijv and S i.i.d., so that K and N represent the number of transmit and receive 
antennas, respectively. 

• The second example system is CDMA in frequency- selective Rayleigh fading, for which 
S contains either i.i.d. or isometric signatures, and H is a square N x N matrix (hence 
(3 = 1), for which the a.e.d. of the channel correlation matrix H* is exponential with mean 
one (i.e., fn(h) = exp(-h) for h > is the density used to compute the H m , n values). 

• The third example system is a SISO FIR channel with a cyclic prefix, as described after 
Proposition 1 in Section II-E, where h = [0.227, 0.46, 0.688, 0.46, 0.227] t (i.e., where 
the ALS and MMSE receiver is used to equalize the so-called Proakis Channel-C [26, pp. 
616]). That is, the empirical results will be obtained using H given by the circulant matrix 

10 That is, we are assuming each data stream is independently coded and decoded. Also, we are assuming that the residual 
multi-access interference (MAI) is Gaussian, which one would expect to be valid in the asymptotic limit considered due to the 
central limit theorem. 
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obtained from h, and S = A = Ijy. As described in Proposition 1, the analytic results 
are obtained from the isometric S equations with a — 1, and H distributed according to 
the spectra of h, i.e., Fu{h) = 4p 2~2n=i u (h ~~ 2( n ))> where u(t) is the step function, and 
Q(n) = |DFT^(h)| 2 , where DFT^(h) denotes the n th element of the iV-point discrete 
Fourier transform of h. 

Unless otherwise stated, we shall assume equal transmit power per data stream (i.e., A = I K ), 
and SNR= 10 dB, where SNR is defined as the energy transmitted per data stream in each 
symbol interval, divided by a 2 . 

In the following plots we determine empirical values from averages of a size N = 32 system 
with QPSK modulation for comparison with the large system results. The asymptotic values for 
the MMSE curves have been determined from Theorem 1, and the asymptotic values for the 
ALS curves have been determined from Theorem 2, Theorem 3, and Lemma 1. The steady-state 
values of the ALS receiver have been determined from Corollary 1. Where possible, the ALS 
SINR has been determined from the MMSE SINR using Theorem 4 (i.e., any situation with 
i.i.d. training sequences and no diagonal loading). 

A. Transient ALS SINR response and comparison with empirical values 

1 ) MIMO example: Firstly, we demonstrate the relevance of the large-system limit to practical 
finite systems. Fig. 2 shows both asymptotic and empirical values of MMSE and ALS SINR vs. 
training length for the example MIMO system with rich scattering. For the ALS receiver, the 
diagonal loading value is p, — 0.1, and rectangular windowing is used. Clearly, the empirical 
(finite) values match the analytic (asymptotic) values very closely. 

Note that the orthogonal training sequences clearly outperform the i.i.d. training sequences, 
particularly for 'small' rj. This gap also widens as the number of receive dimensions decreases. 
Also, the performance of the semi-blind ALS receiver is comparable to the performance of 
the ALS receiver with training for the 2 to 1 transmit to receive antennas ratio case, but is 
significantly worse in the 1 to 2 transmit to receive antennas ratio case. 

2) CDMA in frequency-selective fading: Fig. 3 shows empirical and asymptotic values of 
MMSE and ALS SINR vs. r] for the example CDMA system in frequency- selective Rayleigh 
fading with a = 0.50. The ALS receiver uses rectangular windowing and a diagonal loading 
constant ji = 0.1. Curves are shown for both i.i.d. and isometric signatures, and i.i.d. and 
orthogonal training sequences. Again, the empirical (finite) values match the analytic (asymptotic) 
values. 
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1/2 transmit/receive antennas 




MMSE - MIMO (empirical N=32) 
MMSE - MIMO (analytic) 

ALS - MIMO & orthog. training seqs (empirical N=32) 
ALS - MIMO & orthog. training seqs (analytic) 
ALS - MIMO & i.i.d. training seqs (empirical N=32) 
ALS - MIMO & i.i.d. training seqs (analytic) 
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MMSE - MIMO (empirical N=32) 
MMSE - MIMO (analytic) 

ALS - MIMO & orthog. training seqs (empirical N=32) 
ALS - MIMO & orthog. training seqs (analytic) 
ALS - MIMO & i.i.d. training seqs (empirical N=32) 
ALS - MIMO & i.i.d. training seqs (analytic) 
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1 1 



(a) ALS with training (b) semi-blind ALS 

Fig. 2. SINR vs. training length: (Rich MIMO) equal transmit power per antenna, SNR= lOdB, /u = 0.1, rectangular windowing. 
Comparison with empirical values N = 32, QPSK modulation. 



Figure 3(b) shows the intuitively pleasing result that for a small number of training symbols 
(i.e., small rj), orthogonal training sequences improve the performance of the ALS receiver more 
than isometric signatures, and as 77 increases, this situation is quickly reversed. This is due to 
the fact that the K i.i.d. training sequences of length % become 'more orthogonal' as i increases, 
and also since isometric signatures consistently outperform i.i.d. signatures. 

In subsequent plots, we shall omit the empirical values, and concentrate on applications of 
the analytical results. 

3) Equalization: Fig. 4 shows empirical and asymptotic values of MMSE and ALS SINR 
vs. 77 for the example SISO FIR system at 20dB SNR. The ALS receiver uses exponential 
windowing and a diagonal loading constant fi = 0.1. Curves are shown for both i.i.d. and 
orthogonal training sequences. Note that Proposition 1 requires that b m is unitarily invariant, 
whereas the empirical values in the figure are based on standard QPSK modulation (i.e., b m 
is not unitarily invariant). Clearly, at least in this case, the asymptotic results are a very good 
approximation for non-unitarily invariant data vectors. 

B. Capacity with exponential windowing 

Now we examine the performance of the ALS receiver with both rectangular and exponential 
windowing, relative to the MMSE receiver. Fig. 5 shows the capacity difference per-signature 
as a function of the window size L (determined from (91) and (92)) for the example CDMA 
system in frequency- selective Rayleigh fading with i.i.d. signatures, i.i.d. training, and a system 
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MMSE MMSE 




(a) ALS with training (b) semi-blind ALS 

Fig. 3. SINR vs. training length: CDMA in frequency-selective fading, SNR=10dB, a = 0.50, /j, = 0.1, rectangular windowing, 
equal power per signature, exponential distribution for a.e.d. of H . Comparison with empirical values N = 32, QPSK 
modulation. 

load of a = 0.75. Curves for the ALS receiver are shown with both rectangular and exponential 
windowing, and diagonal loading constant /i = 0. Also, fp(p) = \5(p — 1) + \5(p — |), that is, 
one quarter of the signatures are transmitted at half the power of the remaining signatures. 

In this figure, we do not take into account the loss in rate due to the training. This is considered 
in the following subsection. Rather, for a single channel use at a certain SNR, we wish to see the 
relative capacity difference between the MMSE receiver (using full CSI), and that obtained by 
the ALS receiver as a function of the number of training symbols used to generate the filter. Also, 
for the ALS receiver, we wish to compare exponential windowing with rectangular windowing 
at a given value of 7] as a function of the exponential windowing window size, L. 

Firstly, we see that for either type of windowing, increasing the number of training symbols is 
an exercise in diminishing returns. Also, we see that as the window size increases, exponential 
windowing asymptotes to rectangular windowing, as would be expected for the time-invariant 
system model (1). Of course, exponential windowing is included to allow for time- varying 
channels. As such, the curves for exponential windowing are a valid approximation for a time- 
varying system in which the coherence time of the system 11 is at least as large as the effective 
window size created by the exponential windowing. As such, the values of capacity or SINR 
obtained represent best possible values, which are only attained if the system remains static for 

"'Coherence time' here refers to the number of symbols over which HSA and a 2 remain approximately constant. 



Submitted to IEEE Trans. Info. Theory - Feb. 05, Revised Aug. 05 



32 




T| — > ™ asymptote (analytic) 
ALS: orthog. training seqs (analytic) 
ALS: i.i.d. training seqs (analytic) 
MMSE (empirical, N=32) 
ALS: L = ™ (empirical, N=32) 
ALS: L= 10 (empirical, N=32) 
ALS: L = 5 (empirical, N=32) 
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Fig. 4. SINR vs. training length: Equalization of Proakis C-channel (i.e., H circulant matrix constructed from impulse 
response, and S = A = Ijv)at SNR=20dB. The ALS receiver uses fi = 0.1, exponential windowing, with training using i.i.d. 
and orthogonal training sequences. Comparison with empirical values N — 32, QPSK modulation. Analytic values are obtained 
using isometric S equations with a — [3 — 1, as specified by Proposition 1. 



the duration of the ALS training period. Extending these results to time-varying systems is an 
open problem, and is likely to be difficult. 

C. Application: Throughput Optimization 

We now demonstrate how the results can be used to optimize the throughput with packet 
transmissions. More training symbols gives a higher ALS SINR, but leaves less room for data- 
carrying symbols in the packet. Clearly, there is an optimal ratio of training symbols to data- 
carrying symbols. Such an optimization has been considered for MIMO block fading channels 
and SISO FIR channels in [27, 28] with an optimal (maximum-likelihood) receiver. In that work, 
the training symbols are used to estimate the channel directly. A lower bound on the capacity 
is derived, and is used to optimize the training length. Related work in [29] applies the large- 
system transient analysis in [16] for the MIMO i.i.d. channel to optimize the training length 
with an ALS receiver (without exponential windowing or diagonal loading). It is shown there 
that for large packet lengths (£) the training length that maximizes capacity grows as O(v^). 
Optimization of power levels between the training and data symbols is also investigated. 

Suppose we consider a packet containing T > i symbols, of which the first i are train- 
ing symbols, and the remainder consists of data-carrying symbols. There are K equal power 
data streams, which are coded independently with capacity-achieving 12 codes with rate R c = 
log 2 (l + SINR ALS ). We focus on the ALS receiver with known training symbols. The number 

12 Here we assume that the residual interference at the receiver output is i.i.d. circularly symmetric complex Gaussian. 
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(a) ALS with training (b) semi-blind ALS 

Fig. 5. Capacity difference between ALS and MMSE receivers vs. window size of exponential window: CDMA in frequency- 
selective fading, i.i.d. signatures, i.i.d. training, SNR=10dB, a = 0.75, n — 0, rectangular and exponential windowing. Note the 
window size for rectangular windowing is rj. Also, fp(p) = §<5(p — 1) + \5{p — |). Curves shown correspond to the first 3/4 
of the signatures. Note that the scale of the vertical axis of Fig. 5(b) is twice that of Fig. 5(a). 



of information bits per block is therefore KR C (T — i), while the number of transmit dimensions 
per block is NT. Therefore, the number of information bits per transmit dimension (hereafter 
referred to as 'normalized capacity') is C = aR e ff, where i? e ff = -R c (l — v/fy an d I = T/N. 
We shall consider the additional limit T — > oo with T/N — > £ > in order to optimize C with 
respect to the normalized training length r). We shall keep Eb/a 2 = SNR/i? e g constant, and 
unless otherwise stated, in the numerical examples E b /a 2 = 10 dB. 

Fig. 6(a) shows the normalized capacity of the example CDMA system in frequency-selective 
fading as a function of r] and a for a normalized block length of £ = T/N = 15. The ALS 
receiver uses rectangular windowing and no diagonal loading. Fig. 6(b) shows the additional 
normalized capacity obtained, relative to the results for i.i.d. training in Fig. 6(a), if orthogonal 
training sequences are used. Although not shown, plots analogous to Fig. 6 may also be obtained 
for isometric S. 

Fig. 6(a) shows that there is an optimum value of r\/i, i.e., the ratio of training length to block 
length, for each value of system load, a. Fig. 7(a) shows the value of normalized capacity at 
the optimum value of 77, again for t = 15, as a function of the system load a. Fig. 7(b) shows 
the corresponding value of ?7 (expressed as a percentage of i) which maximizes the normalized 
capacity. Also shown in Fig. 7(a) is the normalized capacity of the MMSE receiver at the same 
value of E b /a 2 , for both types of signatures. Of course, the MMSE receiver assumes perfect 
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(a) i.i.d. training signatures (b) Additional normalized capacity obtained using 

orthogonal training signatures 

Fig. 6. Throughput optimization: CDMA in frequency-selective fading, i.i.d. S, normalized block length T/N = 15, rectangular 
windowing, no diagonal loading, —£=10 dB. Fig. 6(b) shows the additional normalized capacity (with respect to Fig. 6(a)) 
obtained if orthogonal training sequences are used. 



CSI. 

We now consider throughput optimization for the example MIMO system, and consider the 
growth in normalized capacity with respect to the normalized block length, I. In this example, 
since K represents the number of transmit dimensions, the number of transmit dimensions per 
block is KT, and hence the number of information bits per transmit dimension is C = R e n- 
Figure 8(a) shows the growth in normalized capacity, optimized with respect tor)/£. These results 
show that the gain in using orthogonal training sequences appears to be more pronounced in 
situations where there is a high ratio of transmit antennas to receive antennas (i.e., a > 1). 
Figure 8(b) shows the associated optimal training length rj, expressed as a percentage of the 
block length I, for i.i.d. training sequences. 

Figure 9 shows the optimal value of 77 for orthogonal training sequences corresponding to the 
curves in Figure 8(a). 

It is interesting to note the case a = 4, where we see that 77 is never chosen less than 4. Recall 
that for a > 77, we have orthogonal rows of B and for a < 77, we have orthogonal columns 
of B. Clearly, orthogonal columns are preferable. If the axis were extended, we would see the 
same behavior in the other curves. 



Submitted to IEEE Trans. Info. Theory - Feb. 05, Revised Aug. 05 



35 




0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 I 



(a) Optimal Throughput (b) Optimal Training Length 

Fig. 7. Throughput optimization: CDMA in frequency-selective fading, normalized block length T/N = 15, rectangular 
windowing, no diagonal loading, — f=10 dB, equal power per signature, exponential distribution for a.e.d. of H*. 

VIII. Conclusions 

Determining the transient behavior of ALS algorithms with random inputs is a classical 
problem, which is relevant to many communications applications, such as equalization and 
interference suppression. The large system results presented here are the first set of exact results, 
which characterize the transient performance of ALS algorithms for a wide variety of channel 
models of interest. Namely, our results apply to any linear input-output model (see Proposition 1), 
where the input is unitarily invariant, and the channel matrix has a well-defined a.e.d. with finite 
moments. As such, these results can be used to evaluate adaptive equalizer performance in the 
context of space-time channels. This represents a significant generalization of the previous large 
system results in [16], which apply only to an i.i.d. channel matrix. Furthermore, the analytical 
approach relies only on elementary matrix manipulations, and is general enough to allow for 
orthonormal spreading and/or training sequences, in addition to i.i.d. sequences. Numerical results 
were presented, which show that orthogonal training sequences can perform significantly better 
than i.i.d. training sequences. 

For the general ALS algorithm and model considered, the output SINR can be expressed 
as the solution to a set of nonlinear equations. These equations are complicated by the fact 
that they depend on a number of auxiliary variables, each of which is a particular large matrix 
moment involving the sample covariance matrix. Still, it is relatively straightforward to solve 
these equations numerically. Illustrative examples were presented showing the effect of training 
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(a) Throughput (b) Optimal percentage of training in a block (i.i.d. 

training) 

Fig. 8. Throughput optimization: MIMO channel, rectangular windowing, no diagonal loading, §i=l0 dB, equal transmit 
power per antenna. 



length on the capacity of a block fading channel. 

In the case of i.i.d. sequences without diagonal loading, the set of equations for output SINR 
yields a simple relationship between the SINRs for ALS and MMSE receivers, which accounts 
for an arbitrary data shaping window. This relation shows that ALS performance depends on 
the channel matrix only through the MMSE. In other words, ALS performance is independent 
of the channel shape given a target output MMSE. Whether or not an analogous relation holds 
with diagonal loading, orthogonal training and/or spreading sequences remains an open problem. 
Application of the analysis presented here to more general channel models (e.g., multi-user/multi- 
antenna) is also a topic for further study. 

Appendix I 
Precursor to Asymptotic Analysis 

Definition 1: Let {aiv}iv=i,... an d {&zv}jv=i,... denote a pair of infinite sequences of complex- 
valued random variables indexed by N. These sequences are defined to be asymptotically 
equivalent, denoted x b^, iff |ajv — &jv| as N — > oo, where denotes almost- 
sure convergence in the limit considered. □ 
Clearly x is an equivalence relation, transitivity being obtained through the triangle inequality. 
We shall additionally define asymptotic equivalence for sequences of N x 1 vectors and N x N 
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Fig. 9. Optimal value of normalized training length r\ for orthogonal training sequences corresponding to Figure 8(a). 



matrices in an identical manner as above, where the absolute value is replaced by the Euclidean 
vector norm and the associated induced spectral norm, respectively. 

Lemma 2: If a N x b N and x N x y N , and if |ajv|, \i/n\ and/or | £»^v | , \x N \ are almost surely 
uniformly bounded above 13 over N, then cl n xn x b^VN- Similarly, cln/xn x &tv / Un if |ojv| or 
|6jv| is uniformly bounded above over N, and at least one of inf^ \xn\ and infjv \i/n\ is positive 
almost surely. 

Proof: The fact that clnxn x buUN can be seen after writing a^x^ — b^y^ = a^x^ — 
b N y N + a N y N - a N y N and hence \a N x N - b N y N \ < \a N \ \x N - y N \ + \y N \ \a N - b N \. Alter- 
natively, we may add and subtract b N x N from a N x N — b N y N to obtain \a N x N — b N y N \ < 
\xn\ \cin — &/v| + |&jv| \%n — Dn\- The division property, cln/xn x b N /y N , can be shown in the 
same way 

a N _ bN 

xn Vn 

\ \%N\\yN\ ' \xn\ 

Suppose 5 = ini N \y N \. Given a realization for which \x N —yn\ — ► and 5 > 0, we may 
take N sufficiently large such that \x^ — Un\ < 5/2 and hence \x^\ > 5/2. Alternatively, for a 
realization for which \xn — Vn\ — * and 5' = mf N \xn\ > 0, for N sufficiently large we may 
show \y N \ > 5'/2. Using these facts, and the uniform upper bounds for |ajy| or \b N \ we obtain 
the result. ■ 

13 A sequence {ajv}jv=i... of complex-valued Nxl vectors or scalars is uniformly bounded above over N if sup N |ajv| < oo, 
or in the case of complex-valued N x N matrices, sup N ||ffljv|| < oo. 



< 



|a=jv||s/jv| \vn\ 
[&jy|]gjv-wyj I |ajv-bjy| 



(93) 
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Note that the multiplicative part of Lemma 2 holds for any mixture of matrices, vectors or 
scalars for which the dimensions of and x^ are such that cl n xn makes sense, due to the 
submultiplicative property of the spectral norm. The following definition and related results, 
however, are concerned with scalar complex sequences. 

Definition 2: Let {{ajv,n}n=i..jv}iv=i,... and {{&jv,n}n=i..jv}iv=i,... denote a pair of infinite 
sequences, indexed by N. The ./V th element is a complex-valued sequences of length N, indexed 
by n. These sequences are defined to be uniformly asymptotically equivalent, denoted ajv.n x 
b N)Tl , iff max n <Ar \a N:n - b NjU \ as iV -> oo. □ 
Also we define and b^^ (as defined in Definitions 1 and 2 above) as being uniformly 
asymptotically equivalent (denoted ajy x &jv,nX if o-N,n ^ bN, n where a N ^ n = a N for all n = 



Also, analogous to Lemma 2, we have 

Lemma 3: If a N>n x b N>n and x NjTl x y N>n , and if \a N>n \, \yN,n\ and/or \b N>n \, \x NjTl \ are 
almost surely uniformly bounded above over N and n, then a^x^n ^ b N)n y N ^ n . Similarly, 
o-N,n/xN,n ^ bN,n/yN,n if \a>N,n\ or |6jv,n| is almost surely uniformly bounded above over N and 
n, and at least one of mi N>n \xn,u\ and inf^n \yN,n\ is positive almost surely. 

Lemma 4: If a N , n x 6^,,,, then ^ ^f =i %,n x jf En=i 6 iV,n- 
Proof: This follows immediately from 



Lemma 5: For JV = 1, . . ., let Xjy = Myv — zljv, where Mjy is an N x N Hermitian matrix 
and z £ C + , and suppose u N £ C N . Denote u N = ujyX^ujv. If 



1,...,N. 




(94) 



b = inf |ujv| > , a.s. 



(95) 



JV 



B = sup||Xjv || < oo , a.s. 

N 



(96) 



Then 



Im(wiv) > lm(z) 



B 2 



, a.s. 



(97) 



and hence u N £ C + , almost surely. 

Proof: First, we note the inequality (from the proof of [30, Lemma 16.5]) 



xtY *x xtx 



(98) 
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for any x G C^, 0, and Hermitian positive definite N x N complex- valued matrix Y. 
Now, 

lm(u N ) = ^uj^X- 1 - X^u^v = Im^uj^XtxrW (99) 
and hence using (98) and a realization for which (95) and (96) holds we obtain 



1 



< 



uLx^Xujv 



< 



IXI 



(100) 



Im(ujv) Im^uj^XtX)- 1 ^ Im^^ux) 2 Im(z) \u N \ 2 
which, with (95), (96), and z G C + gives the result. ■ 

The following lemma is an asymptotic extension of the matrix inversion lemma, and is used 
extensively in the subsequent appendices to remove matrix dimensions as described in Section 
III. It is based on an approach in [16]. 

Lemma 6: Let Y N = X N + Vjviijy + u^v^ + cnUnu^, where v N , u N G C^, c N G K*, and 
Xat = Mjv — zIn, where M. N is an N x A" Hermitian matrix and z G C + . Denote 



— uLxJ^vtv 



V N 



UjyXjY^UjV 



Assume that as N — > oo, 



ejv — > 



(101) 
(102) 
(103) 

(104) 



and 



Then, 



as A" — > oo, and 



b = inf lujvl > , a.s., 

N 

B = supmax{||Xjv||, |vjv| , |ujv| 5 |ctv|} < oo , a.s. 

N 



X^ujv - u N v N ) 







1 - u N (v N - C N ) 
X]^(-v N u N + (1 + c N u N )v N ) 



1 - Un (f TV - Cat) 



5 = inf min{|l — u^vnI , |1 — un(vn — cn)\} > 



(105) 
(106) 

(107) 
(108) 

(109) 



almost surely, where 5 depends only on B, b, and 1111(2:). 
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Proof: First note that from Lemma 5, (105), and (106), that un € C + almost surely. 
We therefore consider a realization for which E C + and (104)-(106) holds, and take N 
sufficiently large such that |ejy| < 1/2 and hence 

|l + ejv|>l/2. (110) 

Due to the definition of X, 

||X -1 || < Im(jz) -1 . (Ill) 

Now note that sup N \ujy\ = B 2 lm(z)~ 1 < oo due to (106) and (111). Also, Im(— u^ 1 ) = 
lm(u N )/ \u N \ 2 and sup N \uj f 1 \ = Im('U/v)~ 1 - Additionally, note that lm(v N ) > 0, using an 
identical argument to (99). Using these facts we obtain 

l-^ivl < hvj < Kvj < \ u n\ 2 < { B 2 



1 - u N v N 



\-u N l +v N \ lm(-u N l + v N ) lm(-u N l ) ' Im^jy) 2 \lm(z)b j 

(112) 



In the same way, also using cjy € M*, we obtain an identical uniform upper bound on 1 1 — u N (v N - 
and hence obtain (109). 

In what follows, we will drop the dependence on N from u N , v N , c N , u N , v N , Xjy, and Y N 
to clarify the derivations. Define Xi and X 2 according to the following equations 

Y = X 2 + cuu t (113) 

X 2 = Xi + uv t (114) 

X 1 = X + vu t . (115) 

The matrix inversion lemma gives 

2 l + cutX^u V 
2 1 1 + vtX^u 

, -, x-Wx- 1 
x '= x -TT^w- (118) 

First consider Xj" 1 , and note from (118), (104), Lemma 2, (106), (110), and (111) that 

X^ 1 x X- 1 - X-WX- 1 (119) 

In fact, in the remainder of the proof, we shall repeatedly use (104), (106), (109), (110), and 
(111) in order to apply Lemma 2, without explicitly stating this, however, it should be clear 
from the context. 
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From (119) we obtain 



X^uxX^fu-Mv) 
X^v x X _1 v 
v f X^ x (v f - W)X" 

utXr 1 x utx- 1 



From this, we obtain 



(120) 
(121) 
(122) 
(123) 

(124) 
(125) 
(126) 
(127) 

Before we consider X 2 1 , we first analyze the denominator of the second term in (1 17). Firstly, 
due to (126), we may take N large enough such that [v^X^u + uv\ < 5/2, and hence with 
(109) we obtain |l + v+X^ul > 5/2. With this fact, we obtain (l + v+X^u) -1 x (1 



u f Xi 


l u 




u 


v f Xi 


x v 




V 


v f X^ 






—uv 


u f Xi 


V 








since 



1 1 




vtX^u + 


uv 




1 + vtX^u 1 - uv 




1 -+ 


- vtX^u 




1 - 


uv\ 



< — Iv+X^u + uul 
o z 1 

Now consider X^ 1 , for which from (117) and the preceding discussion we obtain 

X X^v 



1 + vtX^u 
vX^u 



1 — uv 



uv 



-vu + V 



Similarly, 



X 2 x u 



vtXo 1 



utX 2 ^ 



1 



1 — uv 

1 

1 — uv 

1 

1 — uv 



X _1 (u-«v) 
(-uu + + v + )X _1 



U' -MV 



t)x- x 



UV) \ 

(128) 

(129) 

(130) 
(131) 

(132) 
(133) 
(134) 
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and so, 



u T X 2 L v 




+ -»7- — 1 

v T X 2 x u 


rfX^u 




u 




1 — uv 


v+X^v 




V 




1 — uv 



vu 
1 — uv 



l + cu f Xr x vl 



1 — ii(t> — c) 



(135) 
(136) 
(137) 

(138) 



1 — uv 

Before considering Y -1 , we note that from (106), (109), (138), and similar arguments pre 
ceding (128) that |l + cu^X 2 " 1 u| has a positive uniform lower bound and 1/(1 + cu^X 2 x u) > 
(1 — uv)/(l — u(v — c)). 

Considering Y _1 using (116) and the preceding discussion, we obtain 



Y x u 



X^u 



1 + cutX^ x u 



1 — u(v — c) 



■X-Hu-uv). 



Similarly, 



1 + cutX-V 



XrV + c 



WW 



1 — u(v — c) 



1 

1 — u{y — c) 



X~ l (-vxi + (1 + uc)v) 



(139) 
(140) 

(141) 
(142) 
(143) 



Lemma 7: Let A^r be an N x N Hermitian matrix, and suppose A 
Using the definitions and assumptions of Lemma 6, additionally define 



Then, 



trfAjvY,- 11 



N 



as N — > oo. 



-(i) 

; iv 



U 7vX »/ AjyX J- V N 



= vjyX^AjvX^ujv 



,(2) 

-AT 



WAT — Uj V -Xj V 1 AjvXj V 1 Ujv 



tr[AjvX 



Wjv + (v N - c N )u N - e { x> - e 



-(2)' 
■N 



1 - U N (v N - Cat) 



SUPjy ||Ajv|| < oo. 

(144) 
(145) 
(146) 
(147) 

^> (148) 
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Proof: The proof continues from the proof of Lemma 6. Again, we drop the subscript N 
for convenience. We see from (104), (118), (110) and (144) that 

tr[AX^] x trfAX" 1 ] - e« 



while (117), (120), (122), (126), and (144)-(147) give 



tr[AX2 x ] x trfAX^ 1 ] - 
= tr[AX^] + 

Similarly, (132), (134) and (144)-(147) give 



mhX^AX-Hu- 



uv 



uv + vu 



1 — uv 

r(2) 



uve 



(i) 



uv 



u+X^AX^u 



and finally (116) and (138) yield 

tr[AY^] x trfAX^ x ] 



u' 



uv 



uv 



u 



i + u 2 v-u(e {1) + e (2) ) 



uv)' 



1 — uv 



-u+X^AX^u. 



(149) 

(150) 
(151) 

(152) 
(153) 

(154) 
■ 

and q G C N . 



1 — u[v — c 

Combining the above, we obtain (148). 

Lemma 8: [2, Lemma 2.6] Let z E C + , A and BiVxiV Hermitian, r G 
Then, 

|tr [((B - zl)- 1 - (B + rqqt - ^I)' 1 ) A] | < ^jL. (155) 
Lemma 9: [31, Lemma 1] Let Cn, be an N x iV complex- valued matrix with uniformly 
bounded spectral radius for all N, i.e., sup^ ||Cjv|| < oo, and y = [Xx, . . . , X^l \fN , where 
the XiS are i.i.d. complex random variables with mean zero, unit variance, and finite eighth 
moment. Then 

c 



E[|ytCy-tr[C]| 4 ]< 



N 2 



(156) 



where the constant c > does not depend on N, C, nor on the distribution of X{. 

Lemma 10: Let S be K < N columns of an N x N Haar distributed random matrix, and 
suppose s is a column of S. Let Xjv be an N x N complex- valued matrix, which is a non-trivial 
function of all columns of S except s, and B = swp N ||Xjy|| < oo. Then, 



5 tx 



1 



: tr[nx 



< 



c_ 

N 2 



(157) 



N — K 

where II = I N — (SS^ — ss^) and C is a deterministic finite constant which depends only on B 
and a = K/N. 
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Proof: This result is a straightforward extension of [7, Proposition 4]. ■ 
Throughout the subsequent derivations, we shall use the fact that since we have assumed that 
the e.d.f.'s of A 2 , HH^, and W converge in distribution almost surely to compactly supported 
non-random distributions on M*, we have [32] 

K 



lim ^J2i(Pk) = E[f(P)] (158) 

ft— »oo J\ ' 4 

k=l 

Jfe ^I>»> = E l f (f)l (159) 



n=l 



1 1 

lim T VfK) = E[f(W)] (160) 

i— >oo ^ ' J 
m=l 

almost surely, where d n is the n th singular value of H, and / : M* — > M* is any fixed bounded 
continuous function on the support of the a.e.d. of A 2 , the first (3*N eigenvalues of Htf, and 
W, respectively. 

Appendix II 
Proof of Theorem 1 

The analysis in these appendices is based on removing a single dimension from matrices 
and vectors, as described in Section III. The dimension removed will correspond to a particular 
data stream, transmit/receive dimension, or symbol interval. For example, in what follows, H tn 
represents the matrix R with the n th transmit dimension removed. The symbol t n is used in this 
case since the n th transmit dimension is removed. We will use d k when removing the k th data 
stream, and r m for removing the m th received symbol interval. 

k n 

We define x and x according to Definition 2 in Appendix I, where the maximum is over 
k < K and n < N, respectively, and the limit is as (M, N, K) — ► oo with K/N — > a > and 
M/N — >• P > constant, as described in Section II-E. 

A. Definitions 

Let R = (HSA)* — zIm, z E C + . The Stieltjes transform of the e.d.f. of the eigenvalues of 
(HSA)* is given by G%{z) = 7^, and the MMSE SINR in (5) is given by P k Pi, k , where 



7 



N 



1 



J 



— tr[X w ] , for j = 1. (161) 



M 



fffu = h^X^hfc , for j = 1, . . . , 4., < k < K (162) 
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where h k = Hs fc , and 

R 1 ,3 = 1, 

xw = 



R f R 1 , 3 = 2, 

(163) 

R tH^R 1 ,3 = 3, 



R-t(HSA)^R~ 1 ,j=4. 

V 

Furthermore, xjj is defined by removing the /c th data stream, < k < K, from X^'), by 
replacing R, S, and A by R dfe , S dk , and A dk , respectively, where 

R dk = (HS dfc A d J* - zl M , (164) 

S dk is S with the k th column removed, and A dk is A with the k th row and column removed. 
That is, R dk = R - P k h k h{. 

The following proposition shows that we may substitute H with an equivalent matrix, without 
lack of generality. This substitution is essential in the analysis which follows. 

Proposition 2: For the model (1), the distribution of both the Stieltjes transform of the e.e.d. 
of (HSA)* and the MMSE SINR are invariant to the substitution of VD for H, where V is an 
M x M Haar-distributed random unitary matrix, D is a M x N diagonal matrix containing the 
singular values of H. 

Proof: Let T be an independent M x M Haar-distributed random matrix. Now, note that 
the quantities of interest, namely 7^ and p^ k , are unchanged by the substitution of TH for H. 
That is, 

7 f = ^trfR- 1 ] = ^trfTTtR" 1 ] = ltr[((THSA)* - zIm)' 1 } (165) 

Writing THS = (TUj\/)D(Ijj v S), where UmDU^ is the singular value decomposition of H, 
the unitary invariance of T and S infers the result for the Stieltjes transform. A similar treatment 
of pf gives the result for the MMSE SINR. ■ 
Therefore, in the remainder of this appendix, we substitute H with VD everywhere. 14 We denote 
the n th column of V as v n , for < n < M, and define v n = for n > M. Define {d\, . . . , dp*^} 
as the diagonal elements of D, note that (3*N = min(M, N), and define d n = for n > (3*N . 
We can now define 

rf n = ut X|fu n , for j = 1, 2, 3, < n < N (166) 



14 



We stress that VD is an equivalent matrix, as defined in Proposition 2, as opposed to a decomposition of H. 
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where u n = H tn S in A 2 s n , and recall that s n denotes the n tb column of S^. Also, x£ denotes 
X^) with the effect of the n th transmit dimension removed, < n < j3*N, by replacing R, H, 
and S with R in , H in , and S tn , respectively, where 

R tn = (n tn S t ,M - zIm (167) 
Ht n = V tn B tn (168) 

and where V tn and S tn are V and S with their n th column and row removed, respectively, and 
D+ is D with both the n th column and row removed. 

Returning to (162) and (166), note that these quadratic forms are uniformly asymptotically 
equivalent to the following expressions, derived in Appendix III. These will be important in the 
subsequent analysis. 



r k N J £trpH*X«>] , i-i.d. S, 



Ptk * P 1 = < ( 169 ) 



i 



N—K 



tr[nHtX^')Hl , iso. S, 



„ I ^tr[(HSA»)*XW] ,i.i.d.S, 

T l,n~ T j = \ Ar ( 17 °) 



jf Ell Tj!n > S > 



where 



Also, note that 



n = i 7V -ss t . (171) 



s^HtHsfc x (3*E[H) > (172) 



from Lemma 9 or Lemma 10, the Borel-Cantelli lemma, and (159). The positivity of (172) is 
implied by (3* > and E[H] > 0. Note that E[H] > is implied by the assumption that the 
distribution of H has a compact support on R*, and does not have all mass at zero. 
In addition, letting c n = sJjA 2 s n and p — E[P], we have 

c n x ap (173) 

This is shown in a similar manner to (172) using (158), noting that for isometric S it requires s n 
written as E K u) n , where F, K = [I K , Or,n-k], and w\ is the n th row of the N x N Haar matrix 
from which S is taken, i.e., S = @E^. 

We now give several bounds on particular matrix and vector norms which are required in 
order to apply Lemmas 2 and 3 later. Firstly, the assumption that z E C + gives 

HR^II < Im(^)- 1 (174) 
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Secondly, the assumptions on H, S, and A outlined in Section II imply 

sup||H|| 2 <oo (175) 

N 

sup || A || 2 < oo (176) 

N 

||S|| 2 ^ (1 + v^) 2 , (i.i.d. S) (177) 

|s fc | 2 x 1 (178) 

|s n ,| 2 x a (179) 

where (177) is due to [33], while (178) and (179) are shown in an identical manner to (172) 
and (173), respectively. Of course, ||S|| = 1 for isometric S. Moreover, (174)-(179) imply 

sup maxmax{|u n | , |c n |} < oo , a.s. (180) 

sup max{ \pf , max \pj k } < oo , a.s. , for j = 1 . . . 4. (181) 



N 



k<K 



supmaxjlr^l ,max Ir/ll} < oo , a.s. , for j = 1 . . . 3. (182) 



A 1 



n<N 



and additionally, with the assumption that \z\ < oo, 

sup||R||<oo , a.s. (183) 

N 

B. Derivations 

We start by using the matrix inversion lemma to extract the A; th data stream from R, as 
described in Section III. 

This may be applied to the following identity to obtain 

1 1 - 

1 = ^trfRR- 1 ] = -zt» + -Y / P k hlR- 1 h k (185) 

k=l 

N air. PkPi, k 

= - Zl1 + PT<hTTp^ <186) 



Now, we show that 



PkPlk k P fcP f 



(187) 



1 + P kP l k 1 + P fcP f 

First note that (172) and (183) satisfies conditions (95) and (96), respectively, of Lemma 5, and 
hence Im(pf fe ) is uniformly bounded below over K = aN and k < K by some 5 > 0. Now 
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due to (169), we may consider a realization for which max/j<^ \Pi,k ~ Pi\ ~~ > holds, and take 
N sufficiently large such that max fc <^ lm(pf k — p±) < 5/2 so that Im(p^) > 5/2. Moreover, 
note that |l + P k p?\ > |P fc Im(pf )| > \P k \ 5/2 and similarly |l + P k Pi k \ > \Pk\ 5 so that 



PkPlk PkP? 



1 + P kP " 1 + P k pf 



< 



\F 


'k\ 


\n N - n N 
\Pi,k Pi 




\Pk 


2 


P N 1 




n N - n N 1 
Pi, k Pi \ 




i + M* 


i | 







5 \ 5 

Taking the maximum over k and using (181) gives (187) 
From (186), (187), and Lemma 4 we obtain 



Z ll X pPl £ l,l = pi 1 -^,!) ( 189 ) 



where 



S 



1 A PT 



N 



^aTK^ (190) 



For future reference, note that from (184), following the proof of (189) gives 

K p lfl 

Pkplk 



>KHS W ]4g T ^x a (p-^ 1 ) (191) 
1 1 K n N 

itrKHS)*^ 1 ] = x apf ^ (192) 

To prove the remaining equations in Theorem 1, we consider another expansion of the 
correlation matrix R, this time to remove the n* transmit dimension, < n < N, as described 
in Section III. 

R = {H tn S tn + d n v n sl )A 2 (H tn S tn + dnVn&J - zI M (193) 
= R t „ + d n u n vl + dnVnul + d 2 n c n y n \ ] n (194) 

where u„ and c n are defined after (166) and above (173), respectively. 

We now apply Lemma 6 to (194), where Yn, Xjv, vn, u N , and cn in the statement of 
the Lemma correspond to R, R tn , d n u n , v n , and c n , respectively. We shall now verify that the 
conditions of the Lemma are satisfied. For any n < N, since H| n v n = we have R( n v„ = — zv n 
and R| v n = — z*w n , and moreover 



vtR-Vn = -z- 1 (195) 
ut R-> n = vtR"> n = (196) 
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where vJjR^ v n corresponds to un in the Lemma, and (196) satisfies condition (104) of the 
Lemma. Since |v n | = 1, condition (105) is satisfied, and along with (180) and (183) satisfies 
condition (106). Note that r^ n , defined in (166), corresponds to vn in the Lemma. Therefore, 

R V -l + ^-(^-c n) (197) 

V 1 (-dnT* n V n + (1 - d 2 n C n Z- l )\l n ) 

i + e-W,-^) ( ' 

inf min |l + dlz' 1 ^ - c n )\ > , a.s., (199) 

which we shall now use to derive (18) and (19). 

With i.i.d. S, we see from (170) that (191) gives an expression for r-f . For isometric S, we 
use S^S = Ik, and 

1 1 1 N 

-trKHSA 2 )^- 1 ] = -trKHSA^JR- 1 ] = - £ trKHSA^jtR- 1 ] (200) 

n=l 

1 - 

= AT X, tr (( U " + dnCnVnYR- 1 } (201) 
n=l 

where we have used HSA 2 s„ = u n + <i n c n v n . Continuing with the preceding application of 
Lemma 6, we may use Lemma 7 to determine an equivalent asymptotic representation of the 
argument in the sum in (201), where An in the statement of Lemma 7 corresponds to (u n + 
dnCnVnY- That is, using (196), (197), and (198), we note that the terms corresponding to e N 
and are both asymptotically equivalent to — d 2 n c n z~ l T^ n , while the terms corresponding to 
un, vn, and trA^X^ 1 ] are asymptotically equivalent to d 2 n (? n z~ 2 , o^(t^) 2 , and rfj n — d\(? n z~ x , 
respectively. Therefore, after some algebra, we obtain from (148) 

r N -c„, 

tr[(u n + d n c n v n f~R7 1 ] x c n + - 1,n " (202) 

1 + <P n Z-\T? >n - Cn) 

and from Lemma 3, (170), and (173) we obtain 

rf!n - Cn n _ - Op 

Cn 1 + d? n z-^ n -c n )~ ap+ l + dlz- x (rf - ap) ^ 
noting that the bounds required for the application of Lemma 3 are satisfied by (182), p < oo, 
and (199). We therefore obtain from (191), (201), (202), (203), and Lemma 4 that 

a(p - 5£) x ap + /T (rf - ap)(W^ + - 1) (204) 



or equivalently, 
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f3*N f3*N _ _ x , 2 



where 

1 ^ d%> 
^ 1 = p£ 1 + rf^-i( r f - ap) (2 ° 6) 

For i.i.d. S, using (169) and (196)— (197) in the same manner as the derivation of (204), we 
have that 

n=l n=l " x >" 

x -/FaT 1 ?^ (208) 

and similarly from (169), (192), and (208) we obtain for isometric S 

Pi = — {-(5*z- l H 1A - a Pl £ 0>1 ) (209) 
1 — a v 7 

We now simplify the preceding solution by noting that the identity -jgtr[RR -1 ] = 1 may also 

be expanded in the dimension N, as opposed to the dimension K in (189). That is, 

1 + z ^=mH ^4A 2 (Sl n Hl n + rf n s n vt )R-V n (210) 

n=l 

Applying (196) and (197) to the argument to the above sum gives 

§ t a*( S ; „ H t + rf „ g „v»)R-v„ * (u - + ^:s>vr" u " ) <21 » 

i -|- a n z \r l n c n ) 



1 



1 + tftz-^rf - ap) 
and so applying Lemma 4 to (210) with (212) gives 



(212) 



1 + ^x^(1-^) (213) 

We now use (213) and (189) to simplify (205) and (209) in the case of isometric S. Combining 
(213) with (189) gives 1 + P*{H^ X - 1) x 1 - 0(1 + z 7 f ), which combined with (205) gives 

rfxap- ^ (214) 

Similarly, combining (189) with (209) gives 



1 - /5(1 + 



It follows that along a realization for which (158), (159), (189), (190), (191), (206), (208), 
(214), and (215) hold, \Gg{z) — Ti | -> 0, |pf -> 0, and |7f -Ti| -»• 0, where 71, pj, n G 
C+ are solutions to (17)-(19). 
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Appendix III 
Proof of (169) and (170) in Appendix II. 



Here we show max k < K \p^ k — p^\ 



.TV 



and max n <jv \ry n — T- 



N I 



for j — 1 in the 



limit considered. The remaining cases j = 2,3,4 are shown in an identical manner using the 
same results as outlined below. 



Define 



n N ' 
Pl,k 



TV" 
Pl,k 



n dfc 



^trptR^H] 



N-K 
1 

N-K 



trfn^HtR^Hl 



trpHtR-iH] 



, i.i.d. S, 
, iso. S, 
iso. S, 



n + s fc si 



(216) 

(217) 
(218) 



From Lemma 9, Lemma 10, and the Borel-Cantelli lemma, we have p± k x pf fc in the limit 



,7V' 



considered. For isometric S, we obtain \pf k — p^ k 



~K \Pik\> which with (181) gives p±' k 



p±' k . Finally, from Lemma 8 we have I pf — pf k I < 



TV 



.,7V" I 



N-K 
Im(z)TV 



for i.i.d. S, and < 



Im(z)(N-K) 



for 



isometric S, and hence p™ k x p^ . Putting these together, we have p^ k x p^' k x p± for i.i.d. S, 

and pf k x pi' k x p^ k x p^ for isometric S, as claimed in (169). 

/v 



Turning our attention to r\ n , for i.i.d. S define 



< = ^tr[(H t „S tr A 2 )*R^] 

< = ^tr[(H t „S^A 2 )tR- 1 ] 



(219) 
(220) 



Firstly, t[ n x r\ n from Lemma 9 and the Borel-Cantelli lemma. Now, applying Lemma 7 to 

1 



l,n 



T 



TV 



HCH^S^A 2 )*^- 1 -!^ 1 )]! 



(221) 



and using (181)— (182), and (199) it is straightforward to show r- n x r j n . Also, 



.TV" 
l,n 



T 



.TV 



-trtR-^fH^S^AV-CHSA 2 )*)] 



— tr[R 1 (d n u n vl l + d n v n u~l + d 2 n c n v n vl)} 



< 



(2rf n |u n | + d\ \c n \) 



(222) 



iVIm(z) 

where u n and c n are defined in Appendix II- A, and we have used |v n | = 1 and (174). It is clear 
from (175) and (180) that the terms inside the bracket of (222) are uniformly bounded above 
over tV and n, so r/^' x rf. Moreover, as r^ n x rf^ x rj^" x rf we have (170) for i.i.d. S. 
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To show (170) for isometric S, define 



T l,n,m ~ U n,rn~R-t n u n,m 



T 



N' 

l,n,m 



for m, n G {1, . . . , N} with m ^ n and where 



Ht„, m St^ m A 2 s r 



U« ^m^j^A S n V m 



Firstly, note that 



max 

m,n<N,(mj^n) 



I .A. I > 



max 

m,n<N,(mj^n) 



0, Vm,n<iV 



(223) 
(224) 

(225) 

(226) 
(227) 
(228) 



where (226) and (227) can be shown using standard arguments after writing s m and s n in the form 
described in the discussion following (173). Also, (228) is shown in the same way as (196). We 
now focus on a realization for which (226) and (227) hold. Now, max m!n < N ^ m7 L n ) \rf n — r^ nm \ — > 
since W^ n — rf nm \ < 2Im(2:)~ 1 Js^A 2 ^) \d m \ |u n |, to which we apply (175), (180), and 
(226). 



Writing T^ n , m -r^ m = tr[u^ m (R in 1 -R tn 1 ? J], we have that max 



N 



N> 



m,n<N,(m^n) | 'l,n,m T l,n,m\ 

from Lemma 7, since the terms corresponding to un, vn, £ , and in the statement of the 
Lemma uniformly converge to zero (independently of m and n) due to (199), (227), (228). 
Define 



N 



e n el + e m el+ ^ e e e\ 



(229) 



where e n is an N x 1 vector which contains zeros except for a 1 in the n th row. That is, ^ n>m 
is simply the unitary permutation matrix which swaps the n th and m th entries. Also, recall from 
the discussion following (173) that S may be written S = ©E^, and thus s n = E^0^e n . Now, 
note that 



-TV' 



' l,n,m l.m.n 



N' 



E 
E 



trK^S^A^Gten)* - (H in , m S in , m A 2 E / ,ete fn )^Ri 
tr[((H t „ im S^ m A 2 E i ,et* nim e n )t - (H t „ im S, n , m A 2 E i ,0tvl/ n>m e rn )^R- 

(230) 
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where in the second step, we used the unitary invariance of @ to substitute with ty n>m @ 
throughout the previous expression, noting also that this has no effect on S tm n and hence also 
R*n, m - Therefore, max min < JV , (m ^ ) |r^ m - r^'J -> 0. 

Combining the above preceding gives m.wt m ^ n < N ^ m ^ n ) \rf n — rfU — > 0. Moreover, 



1 N 
m=l 



and hence T Xn x r^. 



Appendix IV 
Alternate MMSE SINR of Section IV-B 

Note from (184) that the filter R^h*. has the same SINR as R _1 h fc . The associated signal 
and interference powers are P k \Pik\ 2 an d Pit + a2 P2k respectively, as defined in Appendix 
II-A. It is easily shown that the latter term simplifies to p^l (the complex conjugate of Pi k ), 
and hence the MMSE SINR is P k pi ik . Namely, 

Plk + ° 2 plk = hlR^Rd, " ^ 2 I A /)R^h fc + ^htR^R^h* (232) 
= hlHjfh* = p$ (233) 

We now seek expressions for each of the variables which enter the interference power, without 
using the preceding simplification. Firstly, note that in extension to (195)— (196), 

vtR-XN = M~ 2 ( 234 ) 
vtxgV = ut Xg> v n = , j = 1, 2, 3. (235) 

for n = 1 ... N. 

Considering terms which arise in p® ', defined in (169), we have from (197) and (184), and 
additionally using (234)-(235), 



N 



1 j_y-/3*iv .2 X (j) 7 = 23 

n= (236) 

£ Ef=i PfchtR-ttfR- 1 ^ , j =4. 

/3*Nr 2 (W? 2 + r 2 ^ 2 ) , j = 2, 

(3* \z\- 2 (I + t 3 n )H% 2 , j = 3, (237) 

> •? = 4 - 
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which corresponds to pf for i.i.d. S. Additionally note that 



itr[(HS)*XW] = lX) h I* £, " )h * x < 



k=l 



a H2 c 0,2 

an N f N 
JV cN 



Combining (237) and (238) according to (169) for isometric S gives 



3 — 2, 
3 — 3, 
j = 4. 



(238) 



a(S N 2 - 1) + 1 
a(S N 2 - 1) + 1 
- 1) + 1 



, 3 — 2, 
, J = 3, 
, J =4. 



(239) 



Considering terms which arise in r ? - , defined in (170), we have from (184) 



ltr[(HSA 2 )^0')] = lX:P fe 2 htxO')h fc x apfS N 



j -2,2 



(240) 



fc=i 



for j = 2, 3, which corresponds to for i.i.d. S. Additionally, note that 

1 1 N 

-tr[(HSA 2 St)iX^] = - ^(u n + d nCn v n )tX^(u n + d n c n v n ) 



(241) 



T2(/3*(H$ 2 + + \z\- z ap - 7f W£ 



^(/3*(< 2 - 1) + 1) + f i*r «p - r i 



iV 1 



2,2 ) 



J = 2, 

J = 3. 
(242) 

Now for isometric S, the left hand side of (240) and (242) are equal since S^S = Ik. Hence 
equating these expressions and solving for gives 



(ap?£&-/r\z\ 



\a p — t{ 



N 



2 n/N 
n i,2 



/?*«2-l) + l 

a P3 £ 2,2 ~ P* \ z \~ 2 \ap-r- 



N 



n N 



(243) 



2.2 



It follows from the above and Theorem 1, that along a realization for which (158), (159), 



(237), (238), (240), and (242) hold, |pj v - pj\ -> 0, j = 2,3,4, and |rj v - Tj\ -> 0, j = 2,3, 
where pj and r, are solutions to (25)-(28). In addition, since p^ k + cr 2 p 2k x p^ + cr 2 p^ and 



pffc x Pi ■> me asymptotic SINR is given by (24). 



jv 
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Appendix V 

Proof of (47): Convergence of the e.d.f. of W for Exponential Weighting 

With exponential weighting, the e.d.f. of the i x i matrix W corresponds to the distribution 
of a random variable, Wi, which is uniformly distributed on the set {e^ 1- - 7 /^ : j = 1, . . . ,z}, 
where e = (1 — -^) is the exponential weighting constant. The corresponding distribution function 
is Fwi(w). To prove convergence in distribution of the a.e.d. of W to Fw(w), we show that 
linij^oo F w .{w) = F w (w) where F w (w) is given in (47), i.e., pointwise convergence. We have 

F w .(w) = Pr(Wi <w)= Pr(e i(1 " Jl) < w) = Pr ( J, < 1 - (244) 

\ i me J 

^ 1 + ^ W 1 (245) 



i ^— ' \ i z In e 

j=i 

for e % < w < 1, where J, is a discrete random variable uniformly distributed on the set {j/i : 
j = 1, . . . , i}, and where u(t) is the step function, i.e., u(t) is zero for t < and unity for t > 0. 
Now, 



log(l - linwf log(l-^) lin^oo 
lim dne = lim = — t. — = — = -r] L 



and similarly, the limit of the lower bound on w simplifies to Hindoo e l = e r] l L . Taking the 
limit of the Riemann sum in (245) then gives 

f 1 L 
F w (w) = lim F w .(w) = dt = 1 + -Inw (246) 



v 



where we have also used lim^^ F Wj (u> = 1) = 1. This establishes that as i — > oo, F^w) 
converges in distribution to F(w) = 1 + - \nw for e~ v ^ L < w < 1. 

Appendix VI 
Proof of Theorem 3 

As in Appendix II, the analysis in this appendix is based on removing a single dimension from 
matrices and vectors, as described in Section III. We will use t n when removing the n th transmit 
dimension, d k when removing the k th data stream, and r m for removing the m th received symbol 
interval. 

We define x, x, and x according to Definition 2 in Appendix I, where the maximum is 
over k < K, n < N, and m < i, respectively, and the limit is as (M, N, K, i) — > oo with 
K/N -> a > 0, i/N — ► r) > 0, and M/N -> (3 > constant, as described in Section II-E. 
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A. Definitions 

As in Appendix II, throughout this appendix, we substitute H with VD without loss of 
generality, where V is an M x M Haar-distributed unitary matrix, D is a diagonal M x N 
matrix containing the singular values of H, and define v n and d n . The justification for this 
substitution in this case will be established later in Proposition 3. 

Define the following quantities 

(247) 
(248) 
(249) 
(250) 
(251) 
(252) 
(253) 



r N 


- -r f ± ij) r 


, for j 


= !,■■ 


•,4, 


< m < i 


lj,m 




, for j 


= !,,- 


..,4., 


< m < i 




= w m X r Ww m 


, for j 


= !,■■ 


•,4, 


< m < i 




= ht*<fh* 


, for j 


= !,■■ 


-,4, 


< k < K 




= qlX^q* 


, for j 


= !,■■ 


-,4, 


< k < K 




= rt±Sr n 


, for j 


= 1,2,3, 


< n < N 


-In 


= 


, for j 


= 1,2, 


3. 


< n < (3*N 



where 



n, 



and 



h fe = Hs fc 
1 



OA: 



-7^ WBAi 



X(i) 



w m = — =HSAb r 



R 1 

R tR -i 

R -tH*R 1 

R-t(HSA)*R- 1 



3 = 1, 
3=% 
J = 3, 
J=4 



(254) 

(255) 
(256) 
(257) 
(258) 



(259) 



** (7) * (i) * (i) th 

Further, we define Xr„, Xj , and X^ by removing the contribution of the m training symbol, 
the k th data stream, and the n th transmit dimension, respectively, from as follows: 
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R, fc = y^Wft^ - zI M (261) 
K d =HS dfc A, fc BL+N (262) 



• To remove the contribution of the received training symbol interval, for some < m < i, 
replace R and 1Z with R rm and lZ Tm , respectively, where 

Rr m = 72r ra W rm ^J m - z\ M (260) 

and lZ Tm and W rm are 1Z and W with the m th column, and m th row and column removed, 
respectively. That is, R rm = R — ^r^. 

• To remove the contribution of the k th data stream, for some < k < K, replace R, 1Z, S, 
A, and B with R dk , 1Z dk , S dk , A dk , and B dk , respectively, where 

1. 

i 

^d k ^d k o dk 

and S dk and T5 dk are S and B with their k th columns removed, respectively, and A dk is A 
with the k th column and row removed. 

• To remove the contribution of the n th transmit dimension, for some < n < (3*N, replace 
R, TZ, H, and S with R tn , TZ tn , H tn , H| n , and S tn , respectively, where 

K tn = l -K t W1Z\ n - zl M (263) 

7^ = HJSt.ABt + N (264) 
H tn =V tr D tn , (265) 

and \ tn and S tn are V and S with the n th column and row removed, respectively. Also, 
D+ is D with the n th column and row removed. 
Considering (248)-(253) asymptotically, it can be shown using the same steps as in the proof 
of (169) and (170) in Appendix III that 

" r& S 7f = (266) 



(3a 2 IJ ' m " J M 

r j>m ~ r j = uj + a - lj (267) 

s 

±tr[(HSA)*X^] , i.i.d. B, 



lEU^m ,iso.B,a<r / , (268) 



I i(K-i) 



trfTAtS+HtX^JHSA] , iso. B, a > rj, 
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and 



Pi,k - Pi 



itr[H*X^] , i.i.d. S, 

^trfTEHtX^H] , iso. S, 



(269) 



$1 * t 



N 



£tr[(ftW)*Xtf>] , i.i.d. B, 

^tr[YWfttxWftW] , iso. B, a < rj, 



(270) 



J2k=l ^f,k 



iso. B, a > i], 



and 



T- X T- 

J>" 3 



3,n "3 



where II is defined in (171), and 



^tr[(ftWBA)tX&'>] , i.i.d. S, 

— 

n=l 



li ~ iBBt 



(271) 



(272) 



I x - iB+B 



(273) 



a < T), 
K " - , a > 77. 

Although the derivations of the uniform asymptotic equivalence in (269)-(272) are not shown, 
note that they rely on expressions derived later in this appendix, namely (307)-(309) and (326)- 
(328). 

Note that (172) again applies, and also 



w k x w 



n 



awp 



ap*r]' l pE[H]+a 2 pr]~ l > 



for < k < K, < n < N, and < m < i, and where 



b.IWb fc 



z n = -§t ABtWBAs n 



(274) 
(275) 
(276) 

(277) 
(278) 



p = E[P], and w = E[W]. (274) is shown in identical manner to (172) and (173), in this case 
using (160). (275) can be shown in an identical manner to (173), also using (274), i.e., 



c n x 4tr[WBA 2 Bt] 



1 K 



■ k w k x awp 



(279) 



fc=i 
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Finally, (276) can be shown from fr^r™ x ±tr[(HSA)* + a 2 l M ] using (158), (172), and 
i \nlu m \ 2 x tr[(HSA)t] x ^E[P]E[H\. 

We now give several bounds on particular matrix and vector norms which are required in 
order to apply Lemmas 2 and 3 later. Firstly, the assumption that z <E C + gives 

IIR" 1 !! < Im^)- 1 (280) 

Recall that assumptions on H, S, and A outlined in Section II again give (175)— (179), and 
additionally the assumptions on W, B and N give 

sup ||W|| < oo (281) 

N 

-\\B\\ 2 ^ (1 + v^T 1 ) 2 , (i-i.d. B) (282) 

i 

I|| N ||2 ^ a 2 (l + v^T 1 ) 2 (283) 

i 

i|b fc | 2 xl (284) 
-|b m | 2 x ar," 1 (285) 

| 2 x ^ (286) 

i 

where (282)-(283), like (177), is due to [33]. Of course, ^||B|| 2 = 77* for orthogonal B. 
Moreover, (175)-(179), (280)-(286) imply 

1„ „9 

sup— 7?. < 00 , a.s. (287) 

N % 

sup maxmax{|hfc| , |q^| , Wk} < 00 , a.s. (288) 

N k < K 

supmaxmax{|T n | , \c n \} < 00 , a.s. (289) 

n<N 

sup max{ I f ^ I ,max Ir^l} < 00 , a.s. , for j — 1 . . . 4, (290) 

J m<i J' 

sup max{ I pf I , max I pf k I } < 00 , a.s. , forj = 1...4, (291) 

N 7 k<K 

supmax{ -0- ,max ipf k } < 00 , a.s. , for j = 1 . . .4, (292) 



i 

1 . 

- n r 



N 



k<K 



supmax{ f ■ , max f n } < 00 , a.s. , for j = 1 . . . 3, (293) 
sup max{ J | , max | z>j^ | } < 00 , a.s. , for j = 1 . . . 3, . (294) 



iV 



With the additional assumption that \z\ < 00, we also have 



sup ||R|| < 00 , a.s. (295) 

N 
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B. Derivations 

Note that 7^ = -^tr[R _1 ] is the Stieltjes transform of the e.d.f. of the eigenvalues of ^TZWTV. 
The proof of Theorem 3 proceeds as described in Section III, by applying the matrix inversion 
lemma to remove the effect of the m th training interval from R, < m < i, and applying Lemma 
6 to remove the effect of the k th data stream and n th transmit dimension from R, < k < K, 
< n < (3*N . The removal of each dimension generates a pair of variables; expressions for 
which are then derived. Additionally, these results are applied to expansions of the identity 
RR -1 = I to yield relationships between (z) and the variables generated. 

1) Expanding Dimension i: From the matrix inversion lemma, (247), and (260) we have that 

R -1 r m = i? m R-> m (296) 
where R m = 1/(1 + w rn f^ m ). In addition, since 

w m R r m n m - n^R^Wm X (297) 



due to i 



m r 



; <7 2 d> 2 , we have from the matrix inversion lemma and (297) that 

RT 1 ^™ x RmR'l ((1 + w m ^ m )u m - w m u^ >m h m ) (298) 
R _1 n m x R m R~l (-WmTiV" 1 + 0- + w m^>^ m )n m ) (299) 
Now, we expand the identity R _1 R = Im along dimension i using (296). We have that 

1 = -Ur^R] = -ztf + ijY — — — — ^r^- (300) 
In an identical manner to the proof of (187), 

1 + w rn r^ m 1 + w m rf 
That is, we use (267), and note that (276) and (295) satisfy conditions (95) and (96) of Lemma 
5, which gives a uniform positive lower bound on Im(f f^J over % and m < i. Hence r^ m E C + 
almost surely, and from Lemma 4, 

where the expressions in the right hand side of (302) are related via (54), and 



K* = ~ E n~^W (303) 
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2) Expanding Dimension K: In this case, we write 

1 



R 



-(K dk + A k h k h} k )W(TZ dk + Afchfebj^ - zI M 



R dk + A k q k hl + A k h k ql + w k P k h k h 



(304) 
(305) 



We now apply Lemma 6 to (305), where Ytv, Xtv, vl n , v n , and c N in the Lemma correspond 
to R, R dfc , h k , A k q k , and P k w k , respectively. Therefore, u N and v N in the Lemma correspond 
to pf k and Pki>\^ k , respectively. Note that 



tD-ll 



(306) 



due to N 



h^R d 1 qfe x ipz k and (292) for i.i.d. S, and similarly with isometric S, it can be 



shown (N — K) h^R^q^. has a finite uniform upper bound. And so, (306) with (176) satisfies 
condition (104) of the Lemma. Additionally, (172), (288) and (295) satisfy conditions (105) and 
(106) of the Lemma. 

Therefore, from Lemma 6 we have 



RT 1 !!, x K k R^(h k - A k p^ k q k ) 

R T V x K k R^(-A k ^ k h k + (1 + w k P k p? k )q k ) 



inf min 

N k<K 



> , a.s. 



(307) 
(308) 
(309) 



where K k = 1/(1 - P k p» k (^ k - w k )). 

Now, we expand the identity R _1 R = Im along dimension K using (307)-(308). We have 



1 + z xft = ^trfR^HSABTW^ 1 ] + — trfR^NW^ 1 ] (310) 

Expanding the first term in (310) with respect to dimension K using (307)-(308), we have 
1 1 K 

— trfR-^SABtW^] = — ^A^W^Rr 1 ^ (311) 

k=l 
1 K 

— A k K k (A k w k h k + q^R^hfc - A k p^ k q k ) (312) 



4 7 tr[R" 1 HSAB t W7e t ] + -^rtrfR^NW^] 

iM L J iM 1 J 



iM 



AI 



k=l 
f(l-^l) 

where we have used (269), (270), (274), (306), and (309), and define 

K 



(313) 



£ 



N 



1 5> 



pp 
r k 



(314) 
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Now considering the second term in (310), using (296), (297), and (266) we see 

^rtrtR^NW^t] =±-Y w ^ + n ^r> m x a 2~N W N (315) 

Combining (310), (313), and (315) gives 

0(1 - 7i>f) x a(l - S^) = -ap?(4>? - w)£^ (316) 

Note that up until this point, none of the analysis has relied on the fact that we have made the 
substitution of H with VD. Therefore, we are in a position to state the following proposition, 
which shows that this substitution is valid. Note that, the analysis in the remainder of this 
appendix relies on this substitution having been made throughout. 

Proposition 3: For the model (1), the distribution of both the Stieltjes transform of the e.e.d. 
of jlZWTV and the asymptotically equivalent ALS SINR given in (31) are invariant to the 
substitution of VD for H, where V is an M x M Haar-distributed random unitary matrix, and 
D is a M x N diagonal matrix containing the singular values of H. 

Proof: Using (307)-(308), we may derive the asymptotically equivalent form of the ALS 
SINR given in (31) (see Appendix VII), which depends on and xj)™ , j = 1, . . . , 4. Now note 
that the distributions of -yf, p 1 ^ and i/j^, j = 1, ... ,4 are unchanged by the substitution of H 
by VD. That is, let T be an independent M x M Haar-distributed random matrix. Then, 

7 f = ^[Rr 1 ] = ItrfTTtR- 1 ] = ^tr[((TO)W(TW)t - zl M )^] (317) 

Note that Til = THSABt + TN, and so, writing HS = (TU M )D(Uj v S), where Ua/DU^ 
is the singular value decomposition of H, the unitary invariance of T, S, and N implies the 
result for the Stieltjes transform of R. A similar treatment of p^ and ip^ gives the result for 
the asymptotically equivalent form of the ALS SINR given in the right-hand side of (31). ■ 
3) Expanding Dimension N: Writing 

JV 

TL = 4v n (BAs n )t + N (318) 

n=l 

and substituting this into (8), in a similar manner to (305), we have 

R = R tn + dnVnT^ + d n T n w ] n + ^c n v n v^. (319) 

We now apply Lemma 6 to (319), where Y N , X^r, u^v, v N , and in the Lemma correspond 
to R, R tn , v„, d n T n , and d\c n , respectively. Therefore, un and v n in the Lemma correspond to 
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Ui n and d 2 n f^ n , respectively. We have (289), (295) and |v n | = 1, which satisfy conditions (105) 
and (106) of the Lemma. We now show 

vj, R t "> n x TtR^v n x (320) 

which, with (175) satisfies condition (104) of the Lemma. To see this, note that v^r n x 0, since 

N |vtr n | 2 = ^(vtNWBAs„)* x i-tr[(NWBA)*] x ^E[P]E[W 2 } (321) 

Also, since Hj n v„ = 0, we have 

R tre v n = T^WNtv,, - zv n (322) 

From (322), we find 

1 1 i 

ZT*jL£v n x -rl B^1l tn WNt v „ = ^n^v^t R,;> m (323) 

m=l 

Considering the argument in the preceding sum, note that 



w m a 2 rtR*_ 1 v K 



w m n^v n r^R tri r m x — — ^ (324) 

1 + w m r? t 



where we define 7^ t , rf t , and u)f t from 7^, r{ , and u)f in (266), (267), and (268), respec- 
tively, with the contribution of transmit dimension n removed, as explained in Appendix VI-A. 
Returning to (323), and additionally using 1 + 27^ x r^ tn jf J2ln=i 1+ w ™n , which is shown 
in an identical manner to (302), it follows that 



zu 



N 
'l,t* 

Now, since z, a)^ , and rj^ are in C + almost surely (through an application of Lemma 5), 
and also since rf^l is uniformly bounded above over N and n (which is shown identically to 
(290)), we have (320). 

So we may apply Lemma 6 to (319), to obtain 

R 1 v n x Vji^ (v b - d n ^ n r n ) (326) 

R~V n X KRuC-^fnVn + (1 + ^W)^) (327) 

inf min 1 1 - ^(f^ - c„) | > , a.s. (328) 

N n<N 

where K = 1/(1 - d^Jf^ - c n )). 
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Now, we expand the identity R 1 R = 1m along dimension N using (326)-(327). Continuing 
from (310), we may expand the first term along dimension N to obtain 

1 1 N 

— trfR^HSAB+W^] = -V rf n sJ l AB t W^ t R- 1 v n (329) 



iM 1 1 iM 

n=l 



1 N 

— ^2 d n V n (d n c n v n + T n ) t R^ i (v„ - d ) (330) 



M 

n=l 



1 N 

TjE^K^Jfin-r^) (331) 



M 

n=l 



(332) 



where 



1 P * N d 2 P 
<f/N = 1 \^ C333 , l 

P,<? P*N ^ (1 - d**>f ft* - apw))9 V ^ 

and we have also used (271), (272), (275), and (328). 
Combining (310), (332), and (315) gives 

0(1 - 7f /i>f ) = - 7%) = -i>f (ff - apw)?^ (334) 
4) Identities via moment definitions: In order to find expressions for u>*, p*, ^f, f *, and 
we now apply (296), (298)-(299), (307)-(308), and (326)-(327) to expansions of the definitions 
of these variables, or expressions related to them. After applying (296) or (298)-(299), we shall 
use (266), (267), (268), (297), and (301) in conjunction with Lemmas 3 and 4 to further simplify 
the resulting expression. Similarly, after applying (307)-(308), we shall use (269), (270), (291), 
(292), (274), (306), and (309) in conjunction with Lemmas 3 and 4 to further simplify the 
resulting expression. Finally, after applying (326)-(327), we shall use (271), (272), (293), (294), 
(275), (320), and (328) in conjunction with Lemmas 3 and 4 to further simplify the resulting 
expression. 

Firstly, consider uj^ in (268). From (307)-(308) we obtain 
1 1 K 

-tr[(HSA)tlT x ] x - PkKktifi£ (h* - A k ff q fe ) x -pf £* (335) 
1 1 k=i ^ 

which corresponds to Cj^ for i.i.d. B. For orthogonal B with a < rj, we use (335) and jB^B = I K 

to obtain 



pUix ~ ^[(HSABtjtR" 1 ] x-V ^R- 1 ^ (336) 

m=l 

x <(1 - cDfWfO (337) 
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which is proven in an identical manner to (302), also using (298), and may be simplified using 
(302) to give 

a ~N cN 

< * / , v 038) 

Now if a > r), we have directly from (335) and (337) 

= «^^ tr r TA t S t H tR-iHSA] (339) 
Tj K — i 

a ("fteSi-^ii-tfwzS) 040) 



a — 7] \rj ' a 
Combining this with (302) gives 

t ~NcN 



a~N?N 
N V> 



(341) 



1 l-f(l-7f/^)' 
Now we consider tf, defined in (271), using (307)-(308). Firstly, note that }7£Wb fc = 

\(TZ dk + A k h k hl)Wh k = q k + A k w k h k , and hence 

1 1 - 

— tr^WE^R" 1 ] = — PkhlwrtR-'nWb, (342) 

fc=i 
1 K 

= j7 2^ Pfc ( qfe + Ah^M^R X (qfc + A k w k h k ) (343) 

k=l 



a J7 P * K S? - ™ p kPx$i ~ w)) (344) 



fc=i 



x a(wp + « - w)E? tl ) (345) 

which corresponds to ff for i.i.d. S. To find an expression for with isometric S, we use 
S^S = Ik and (345) in reverse, i.e., 

a{wp + - w)£") x -^-tr[(7eWBAS t )*R- 1 ] (346) 

1 N 

T7 Yl V '^ Tn + ^nC n v n ) t R- 1 (r n + rf n (c n - f 1 7V n )v n ) (347) 



iV 

n=l 



1 * 

^ W) (348) 



N 

n=l 



x apw + (ff - apw) - 1) + 1) (349) 

where we have used 47£WBAs n = j(T^t n + d n v n s n ) WB As„ = r^ + d^v^. Moreover, using 
(334) to simplify (349) we obtain 



ff x apw + - — , ' /I' (350) 



1 - 0(1 - Pf) 
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Now we consider pf defined in (269), using (326)-(327). Firstly, 

f3*N (3*N 

-trfHtR- 1 ] = - d 2 y n K-^ n x - £ d 2 n V n ^-\ n (351) 

n=l n=l 

x vF/rn^ (352) 

which corresponds to p^ with i.i.d. S. Note that -^tr^HS^Rr 1 ] x apfS^ follows from an 
analogous derivation to (335). Therefore, combining this with (352), we obtain for isometric S, 

Pi=Y^ (^P^ 1 ] " ^trKHS^RT 1 ]) (353) 
(/^Wfi - aft 4^i) (354) 



1 — a 

Moreover, combining with (316) gives 



1-/5(1- WW) 



Pi- 

Now we consider tp± , as defined in (270), using (296). Firstly, 



itr[(7eW)*R- 1 ] = - V - m % x w - )< (356) 



which corresponds to for i.i.d. B. Note that ^tr^WB^Rr 1 ] x a{w + (^f - w)^) 
from an analogous derivation to (345). Therefore, combining this with (356), we obtain for 
orthogonal B and a < rj, 



7) — a ' rj 

Moreover, combining with (316) gives 



f ^trffftW^R" 1 ] - 4tr[(^WB) t R- 1 ] J (357) 
^ _ y^r _ ^ + _ ^^j) (358) 



rj — a 



'/ 



W = w s ^7 — (359) 

i-!(i-7fM 7V ) 



Finally, for orthogonal B and a > r], from (356) and -^B^B = Ij, we have 

w - Wfi x trffftWB^RT 1 ] (360) 
x u) + (^f - (361) 
following an analogous derivation to (345). Moreover, combining with (316) gives 
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To derive an expression for z>i, we start with (322) and obtain 

1 - 

1 + z Kn = ~ u; ^ n L v nv] l R t ; 1 r m (363) 
for which, like (324), we have 



i 



„, n t vv tR-i r 5 WmU llg (3 6 4) 

1 + iu m rf 



where we have additionally used r^ mt x r± m x f^, which follows from Lemma 7, continuing 
on from the application of Lemma 6 in Appendix VI-B.3. Therefore, from (363) and (364), we 
obtain 

1 + zv? x a^fWj (365) 

It follows that along a realization for which (158)-(160), (267), (313), (335), (338), (341), 
(345), (350), (352), (355), (356), (359), (362), and (365) hold, Ipf - pA ^ 0, Iff - fA ^ 0, 



0, \uj^ —uA 0, |z>f — z>i| 0, and Iff — fA — -> 0, where 71, pi, fi, 



$1, o>i, i>i, and fi G C + are solutions to (37)-(43). 

Appendix VII 
Proof of Theorem 2: Asymptotic SINR 

This proof continues on from the proof of Theorem 3 in Appendix VI. Firstly, note that the 
steering vector (10) can be written as 

{qfc + A k w k h k , ALS with training 
(366) 
h k , semi -blind ALS. 

That is, (307) is the equivalent asymptotic form for the semi-blind LS filter given in (7). Also, 
using (366), (307)-(308) we obtain the equivalent asymptotic form for the ALS filter with 
training. We can express the asymptotic form for both receivers as 

c fc x K k R-£(a% tl h k + af 2 q k ) (367) 

where 

{(1, -A k pF h ) , semi-blind LS 

(368) 
(A k (w k - t/>5J, 1) , LS with training, 

We now compute the large-system SINR for the filter (367) for stream k, and symbol interval 
m > i. For notational simplicity, we drop the subscript m in this appendix. Note therefore that 
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in this appendix b dk denotes b m with the k th element removed. Since Kf. cancels in the ratio 
when calculating the SINR, we ignore this constant. The signal component is 

A k c\h k b{k) x A k (aj^hfc + < 2 q fc ) ] R^h fc b(A;) (369) 

x A k a^Mk)^- d S (370) 

using (176), (291), and (306). The interference component is 

4 (HS dfe A dk h dk + n) x (o^h* + < 2 q fc ) f R"t (HS dfe A dk h dk + n) (371) 

x afi (htR^HS dfc A dk h dk + hlR^n) 

+ <2 (qlR^ HS, fc A dk h dk + qtR-tn) . 

Therefore, the signal power is asymptotically equivalent to P k \ a k ,i | 2 | Pi k | 2 . Also, the interference 
power, averaged over the data symbols and noise is asymptotically equal to 

|<r| 2 (/& + * 2 PL) + |< 2 | 2 + * 2 4V (372) 
Moreover, from (250), (251), (269), (270), we have 

p I N I 2 I - |2 

SINRf^ x --^ ; ^. (373) 

' |<il + ^) + l< 2 | W + ^ 2 <) 
Appendix VIII 
Steady-State ALS SINR with Windowing 

By inspection of Theorem 3 and Lemma 1, we note the following limits as rj — > oo: 

7,- = lim <^ (374) 
^"[W ,3 = 2,3,4. 

fwi , j = 1, 
(375) 

<V?7 > J = 2,3,4. 



77-^00 



fj = lim <( (376) 
rj/v , J = 2,3,4. 



rj—too 



and 



= lim { (377) 




^ = lim { (378) 
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and 



,.7 = 1, 

i/j = lim { (379) 



V~ > °° I o / 2 



,j = 2,3. 
77T1 , j = 1, 

r,- = lim { (380) 



77— >oo 



Tj , j = 2, 3. 



and 

to = lim rjw (381) 

77— >oo 

W m ,„ = lim r/W m , n (382) 

JJ— »oo 

£ m ,n = lim £ m ,n (383) 

77— >oo 

7Y min = lim 7T! min (384) 

77— >oo 

Moreover, the variables jj, pj, fj, ipj, Uj, Vj, fj, £ m ,n, 'Hm,n, and W m ,n satisfy the same set of 
equations as 7^, pj, fj, ipj, uoj, f>j, fj, £ m>n , H m ,n, and W m , n , respectively (for appropriate values 
of j, m, and n), if we set z = —p, r\ — 1, and assume that B is i.i.d.. 

Appendix IX 
Sketch of Proof of Theorem 4 

We provide a sketch of the proof for i.i.d. S only, mainly due to the fact that the equations 
are simpler to manipulate. However, the same approach is valid for isometric S. 

Firstly, according to the remark made after Theorem 3, with i.i.d. B and p = we have 
Wiipi = Pi- That is, the term p\ in the numerator of the asymptotic SINR in (31) can be 
written as -^1— SINRf MSE . 

Consider the denominator of the alternate MMSE SINR expression in Section IV-B with i.i.d. 
S. Solving (25)-(26) for p 4 + a 2 p 2 gives 

P4 + ^P2 = 2 , (385) 

Z — ap 02,2/12,2 

which from Appendix IV equals p\. 

The next step is to simplify the p^ + <J 2 p2 term in the denominator of the ALS SINR (31). To 
do this, we solve (55), (56), (59), and (65) to find p 3 , f 3 , z> 3 in terms of p 4 and p 2 , and then 
substitute into (55). We then solve (64), (56), and (59) for 72 f 2 , and t/j 2 in terms of p 4 and p 2 , 
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and substitute into (55). Combining these results gives 



Now we use the identity (71) to obtain 5 lj 



Pa + cr 2 p 2 




(386) 



Pi 



1 



SINR MMSE_ 



(387) 



Wi, 2 



PfcWi,iWi, 2 



Next we simplify the i/> 4 + o" 2 2 term in the denominator of the ALS SINR (31). To do this, 
we solve (64), (60), (59), (59) and (60) for 72, C02, $2, 4>4, and c2> 4 , and form the sum 



Substituting (387) and (389) into (31) along with the expressions for a^i and from (32), 
and simplifying gives (82) and (83). 
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